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hCEA-LTA Nucleotide Sequence 



1 


ATGGAGTCTC 


CCTCGGCCCC 


51 


GCTCCTGCTC 


ACAGCCTCAC 


101 


CCAAGCTCAC 


TATTGAATCC 


151 


GTGCTTCTAC 


TTGTCCACAA 


201 


GTACAAAGGT 


GAAAGAGTGG 


251 


TAGGAACTCA 


ACAAGCTACC 


301 


ATATACCCCA 


ATGCATCCCT 


351 


AGGATTCTAC 


ACCCTACACG 


401 


CAACTGGCCA 


GTTCCGGGTA 


451 


AGCAACAACT 


CCAAACCCGT 


501 


TGAACCTGAG 


ACTCAGGACG 


551 


GCCTCCCGGT 


CAGTCCCAGG 


601 


ACTCTATTCA 


ATGTCACAAG 


651 


CCAGAACCCA 


GTGAGTGCCA 


701 


TCTATGGCCC 


GGATGCCCCC 


751 


TCAGGGGAAA 


ATCTGAACCT 


801 


ACAGTACTCT 


TGGTTTGTCA 


851 


TCTTTATCCC 


CAACATCACT 


901 


GCCCATAACT 


CAGACACTGG 


951 


AGTCTATGCA 


GAGCCACCCA 


1001 


CCGTGGAGGA 


TGAGGATGCT 


1051 


AACACAACCT 


ACCTGTGGTG 


1101 


CAGGCTGCAG 


CTGTCCAATG 


1151 


CAAGGAATGA 


TGTAGGACCC 


1201 


GTTGACCACA 


GCGACCCAGT 


1251 


CCCCACCATT 


TCCCCCTCAT 


1301 


GCCTCTCCTG 


CCATGCAGCC 


1351 


ATTGATGGGA 


ACATCCAGCA 


1401 


CACTGAGAAG 


AACAGCGGAC 


1451 


GTGGCCACAG 


CAGGACTACA 


1501 


CCCAAGCCCT 


CCATCTCCAG 


1515 


TGCTGTGGCC 


TTCACCTGTG 


1601 


GGTGGGTAAA 


TGGTCAGAGC 


1651 


AATGGCAACA 


GGACCCTCAC 


1701 


AGCCTATGTA 


TGTGGAATCC 


1751 


CAGTCACCCT 


GGATGTCCTC 


1801 


CCAGACTCGT 


CTTACCTTTC 


1851 


GGCCTCTAAC 


CCATCCCCGC 


1901 


AGCAACACAC 


ACAAGTTCTC 


1951 


GGGACCTATG 


CCTGTTTTGT 
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TCCCCACAGA TGGTGCATCC CCTGGCAGAG 
TTCTAACCTT CTGGAACCCG CCCACCACTG 
ACGCCGTTCA ATGTCGCAGA GGGGAAGGAG 
TCTGCCCCAG CATCTTTTTG GCTACAGCTG 
ATGGCAACCG TCAAATTATA GGATATGTAA 
CCAGGGCCCG CATACAGTGG TCGAGAGATA 
GCTGATCCAG AACATCATCC AGAATGACAC 
TCATAAAGTC AGATCTTGTG AATGAAGAAG 
TACCCGGAGC TGCCCAAGCC CTCCATCTCC 
GGAGGACAAG GATGCTGTGG CCTTCACCTG 
CAACCTACCT GTGGTGGGTA AACAATCAGA 
CTGCAGCTGT CCAATGGCAA CAGGACCCTC 
AAATGACACA GCAAGCTACA AATGTGAAAC 
GGCGCAGTGA TTCAGTCATC CTGAATGTCC 
ACCATTTCCC CTCTAAACAC ATCTTACAGA 
CTCCTGCCAC GCAGCCTCTA ACCCACCTGC 
ATGGGACTTT CCAGCAATCC ACCCAAGAGC 
GTGAATAATA GTGGATCCTA TACGTGCCAA 
CCTCAATAGG ACCACAGTCA CGACGATCAC 
AACCCTTCAT CACCAGCAAC AACTCCAACC 
GTAGCCTTAA CCTGTGAACC TGAGATTCAG 
GGTAAATAAT CAGAGCCTCC CGGTCAGTCC 
ACAACAGGAC CCTCACTCTA . CTCAGTGTCA 
TATGAGTGTG GAATCCAGAA CGAATTAAGT 
CATCCTGAAT GTCCTCTATG GCCCAGACGA 
ACACCTATTA CCGTCCAGGG GTGAACCTCA 
TCTAACCCAC CTGCACAGTA TTCTTGGCTG 
ACACACACAA GAGCTCTTTA TCTCCAACAT 
TCTATACCTG CCAGGCCAAT AACTCAGCCA 
GTCAAGACAA TCACAGTCTC TGCGGAGCTG 
CAACAACTCC AAACCCGTGG AGGACAAGGA 
AACCTGAGGC TCAGAACACA ACCTACCTGT 
GTCCCAGTCA GTCCCAGGCT GCAGCTGTCC 
TCTATTCAAT GTCACAAGAA ATGACGCAAG 
AGAACTCAGT GAGTGCAAAC CGCAGTGACC 
TATGGGCCGG ACACCCCCAT CATTTCCCCC 
GGGAGCGAAC CTCAACCTCT CCTGCCACTC 
AGTATTCTTG GCGTATCAAT GGGATACCGC 
TTTATCGCCA AAATCACGCC AAATAATAAC 
CTCTAACTTG GCTACTGGCC GCAATAATTC 



FIG.2A-1 
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2001 


CATAGTCAAG 


AGCATCACAG 


2051 


ACAAATTATA 


CCGTGCTGAC 


2101 


GGAGGTCTTA 


TGCCCAGAGG 


2151 


AATGAATATT 


AATCI 1 IATG 


2201 


TCAGATATGA 


TGACGGATAT 


2251 


CACTTAGCAG 


GACAGTCTAT 


2301 


TGTTATAGCG 


ACAGCACCAA 


2351 


TATACAGCCC 


TCACCCATAT 


2401 


CCATATTCTC 


AGATATATGG 


2451 


TGAACGATTA 


CATCGTAACA 


2501 


TGAATATAGC 


TCCGGCAGAG 


2551 


GATCACCAAG 


CTTGGAGAGA 


2601 


TTGTGGAAAT 


TCATCAAGAA 


2651 


CCCAGAATCT 


GAGCACAATA 


2701 


AGGCAGATAT 


1 1 ICAGACTA 




TCGGGATGAA 


TTATGA (SEC 
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TCTCTGCATC TGGAACTCTA GTTAATGGCG 
TCTAGACCCC CAGATGAAAT AAAACGTTCC 
GCATAATGAG TACTTCGATA GAGGAACTCA 
ATCACGCGAG AGGAACACAA ACCGGCTTTG 
GTTTCCACTT CTCTTAGTTT GAGAAGTGCT 
ATTATCAGGA TATTCCACTT ACTATATATA 
ATATGTTTAA TGTTAATGAT GTATTAGGCG 
GAACAGGAGG TTTCTGCGTT AGGTGGAATA 
ATGGTATCGT GTTAATTTTG GTGTAATTGA 
GGGAATATAG AGACCGGTAT TACAGAAATC 
GATGGTTACA GATTAGCAGG TTTCCCACCG 
AGAACCCTGG ATTCATCATG CACCACAAGG 
CAATTACAGA TGATACTTGT AATGAGGAGA 
TATCTCAGGA AATATCAATC AAAAGTTAAG 
TCAGTCAGAG GTTGACATAT ATAACAGAAT 
) ID N0:7) 
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CEA-LTA Amino Acid Sequence 



1 

X 


MFSPSAPPHR 


WCIPWORLLL 

V V w J. 1 V i \^( I \ 1 1 1 


51 

\J X 


VLLLVHNLPO 

VI LL VI IIXI — 1 \_( 


HLFGYSWYKG 


101 

JL U JL 


TYPNASI 1 TO 


NII0NDTGFY 


151 

X JL 


SNNSKPVEDK 


DAVAFTCFPF 

VJ l \ V 1 \ \ 1 Kj 1 I 1 


201 


Tl FNVTRNDT 

II 1 IN V 1 1 \l X VJ 1 


ASYKCETONP 


251 

C—yJ X 


SGFNI Nl SCH 


AASNPPAOYS 

j \j\v^ ini i i o 


301 


AHNSDTGLNR 

/VI 1 1 X v_-> VJ 1 \JL 1 1 1 1 \ 


TTVTTITVYA 

i i v i i x i v i n 


351 


NTTYLWWVNN 

INI l 1 LlrVIrt V nil 


0SLPVSPRL0 


401 

i W X 


VDHSDPVILN 


VI YGPDDPTI 


451 


IDGNI00HT0 


ELFISNITEK 


501 


PKPSTSSNNS 


KPVFDKDAVA 


551 


NGNRTLTLFN 


VTRNDARAYV 


601 


PDSSYLSGAN 


LNLSCHSASN 


651 


GTYACFVSNL 


ATGRNNSIVK 


701 


GGLMPRGHNE 


YFDRGTQMNI 


751 


HLAGQSILSG 


YSTYYIYVIA 


801 


PYSQIYGWYR 


VNFGVIDERL 


851 


DHQAWREEPW 


IHHAPQGCGN 


901 


RQIFSDYQSE 


VDIYNRIRDE 



TASLLTFWNP PTTAKLTIES TPFNVAEGKE 
ERVDGNRQII GYVIGTQQAT PGPAYSGREI 
TLHVIKSDLV NEEATGQFRV YPELPKPSIS 
TQDATYLWWV NNQSLPVSPR LQLSNGNRTL 
VSARRSDSVI LNVLYGPDAP TISPLNTSYR 
WFVNGTFQQS TQELFIPNIT VNNSGSYTCQ 
EPPKPFITSN NSNPVEDEDA VALTCEPEIQ 
LSNDNRTLTL LSVTRNDVGP YECGIQNELS 
SPSYTYYRPG VNLSLSCHAA SNPPAQYSWL 
NSGLYTCQAN NSASGHSRTT VKTITVSAEL 
FTCEPEAQNT TYLWWVNGQS LPVSPRLQLS 
CGIQNSVSAN RSDPVTLDVL YGPDTPIISP 
PSPQYSWRIN GIPQQHTQVL FIAKITPNNN 
SITVSASGTL VNGDKLYRAD SRPPDEIKRS 
NLYDHARGTQ TGFVRYDDGY VSTSLSLRSA 
TAPNMFNVND VLGVYSPHPY EQEVSALGGI 
HRNREYRDRY YRNLNIAPAE DGYRLAGFPP 
SSRTITDDTC NEETQNLSTI YLRKYQSKVK 
L (SEQ ID N0:8) 
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hCEA-LTB Coding Sequence 



1 


ATGGAGTCTC 


CCTCGGCCCC 


51 


GCTCCTGCTC 


ACAGCCTCAC 


101 


CCAAGCTCAC 


TATTGAATCC 


151 


GTGCTTCTAC 


TTGTCCACAA 


201 


GTACAAAGGT 


GAAAGAGTGG 


251 


TAGGAACTCA 


ACAAGCTACC 


301 


ATATACCCCA 


ATGCATCCCT 


351 


AGGATTCTAC 


ACCCTACACG 


401 


CAACTGGCCA 


GTTCCGGGTA 


451 


AGCAACAACT 


CCAAACCCGT 


501 


TGAACCTGAG 


ACTCAGGACG 


551 


GCCTCCCGGT 


CAGTCCCAGG 


601 


ACTCTATTCA 


ATGTCACAAG 


651 


CCAGAACCCA 


GTGAGTGCCA 


701 


TCTATGGCCC 


GGATGCCCCC 


751 


TCAGGGGAAA 


ATCTGAACCT 


801 


ACAGTACTCT 


TGGTTTGTCA 


851 


TCTTTATCCC 


CAACATCACT 


901 


GCCCATAACT 


CAGACACTGG 


951 


AGTCTATGCA 


GAGCCACCCA 


1001 


CCGTGGAGGA 


TGAGGATGCT 


1151 


AACACAACCT 


ACCTGTGGTG 


1101 


CAGGCTGCAG 


CTGTCCAATG 


1151 


CAAGGAATGA 


TGTAGGACCC 


1201 


GTTGACCACA 


GCGACCCAGT 


1251 


CCCCACCATT 


TCCCCCTCAT 


1301 


GCCTCTCCTG 


CCATGCAGCC 


1351 


ATTGATGGGA 


ACATCCAGCA 


1401 


CACTGAGAAG 


AACAGCGGAC 


1451 


GTGGCCACAG 


CAGGACTACA 


1501 


CCCAAGCCCT 


CCATCTCCAG 


1551 


TGCTGTGGCC 


TTCACCTGTG 


1601 


GGTGGGTAAA 


TGGTCAGAGC 


1651 


AATGGCAACA 


GGACCCTCAC 


1701 


AGCCTATGTA 


TGTGGAATCC 


1751 


CAGTCACCCT 


GGATGTCCTC 


1801 


CCAGACTCGT 


CTTACCTTTC 


1851 


GGCCTCTAAC 


CCATCCCCGC 



TCCCCACAGA TGGTGCATCC CCTGGCAGAG 
TTCTAACCTT CTGGAACCCG CCCACCACTG 
ACGCCGTTCA ATGTCGCAGA GGGGAAGGAG 
TCTGCCCCAG CATCTTTTTG GCTACAGCTG 
ATGGCAACCG TCAAATTATA GGATATGTAA 
CCAGGGCCCG CATACAGTGG TCGAGAGATA 
GCTGATCCAG AACATCATCC AGAATGACAC 
TCATAAAGTC AGATCTTGTG AATGAAGAAG 
TACCCGGAGC TGCCCAAGCC CTCCATCTCC 
GGAGGACAAG GATGCTGTGG CCTTCACCTG 
CAACCTACCT GTGGTGGGTA AACAATCAGA 
CTGCAGCTGT CCAATGGCAA CAGGACCCTC 
AAATGACACA GCAAGCTACA AATGTGAAAC 
GGCGCAGTGA TTCAGTCATC CTGAATGTCC 
ACCATTTCCC CTCTAAACAC ATCTTACAGA 
CTCCTGCCAC GCAGCCTCTA ACCCACCTGC 
ATGGGACTTT CCAGCAATCC ACCCAAGAGC 
GTGAATAATA GTGGATCCTA TACGTGCCAA 
CCTCAATAGG ACCACAGTCA CGACGATCAC 
AACCCTTCAT CACCAGCAAC AACTCCAACC 
GTAGCCTTAA CCTGTGAACC TGAGATTCAG 
GGTAAATAAT CAGAGCCTCC CGGTCAGTCC 
ACAACAGGAC CCTCACTCTA CTCAGTGTCA 
TATGAGTGTG GAATCCAGAA CGAATTAAGT 
CATCCTGAAT GTCCTCTATG GCCCAGACGA _ 
ACACCTATTA CCGTCCAGGG GTGAA6GTCA ' 
TCTAACCCAC CTGCACAGTA TTCTTGGCTG ; 
ACACACACAA GAGCTCTTTA TCTCGAACAT ; 
TCTATACCTG CCAGGCCAAT AACTCAGCCA ; 
GTCAAGACAA TCACAGTCTC TGCGGAGCTG 
CAACAACTCC AAACCCGTGG AGGACAAGGA 
AACCTGAGGC TCAGAACACA ACCTACCTGT 
CTCCCAGTCA GTCCCAGGCT GCAGCTGTCC 
TCTATTCAAT GTCACAAGAA ATGACGCAAG 
AGAACTCAGT GAGTGCAAAC CGCAGTGACC 
TATGGGCCGG ACACCCCCAT CATTTCCCCC 
GGGAGCGAAC CTCAACCTCT CCTGCCACTC 
AGTATTCTTG GCGTATCAAT GGGATACCGC 
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1901 


AGCAACACAC 


1951 


GGGACCTATG 


2001 


CATAGTCAAG 


2051 


AGTCTATTAC 


2101 


ATAAATGACA 


2151 


AATGGTTATC 


2201 


CGGGCAGTCA 


2251 


GACACATTAA 


2301 


TGTATGGAAT 




ACTAG (SEQ 
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ACAAGTTCTC TTTATCGCCA 
CCTGTTTTGT CTCTAACTTG 
AGCATCACAG TCTCTGCATC 
AGAACTATGT TCGGAATATC 
AGATACTATC ATATACGGAA 
ATTACATTTA AGAGCGGCGC 
ACATATAGAC TCCCAAAAAA 
GAATCACATA TCTGACCGAG 
AATAAAACCC CCAATTCAAT 
ID N0:9) 



AAATCACGCC AAATAATAAC 
GCTACTGGCC GCAATAATTC 
TGGAACTCTA GATGCTCCCC 
GCAACACACA AATATATACG 
TCGATGGCAG GTAAAAGAGA 
AACATTTCAG GTCGAAGTCC 
AAGCCATTGA AAGGATGAAG 
ACCAAAATTG ATAAATTATG 
TGCGGCAATC AGTATGGAAA 



FIG.3A-2 
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CEA-LTB Amino Acid Sequence 



1 


MESPSAPPHR 


WCIPWORLLL 


51 


VLLLVHNLPQ 


HLFGYSWYKG 


101 


IYPNASLLIQ 


NIIQNDTGFY 


151 


SNNSKPVEDK 


DAVAFTCEPE 


201 


TLFNVTRNDT 


ASYKCETQNP 


251 


SGENLNLSCH 


AASNPPAQYS 


301 


AHNSDTGLNR 


TTVTTITVYA 


351 


NTTYLWWVNN 


QSLPVSPRLQ 


401 


VDHSDPVILN 


VLYGPDDPTI 


451 


IDGNIQQHTQ 


ELFISNITEK 


501 


PKPSISSNNS 


KPVEDKDAVA 


551 


NGNRTLTLFN 


VTRNDARAYV 


601 


PDSSYLSGAN 


LNLSCHSASN 


651 


GTYACFVSNL 


ATGRNNSIVK 


701 


INDKILSYTE 


SMAGKREMVI 


751 


DTLRITYLTE 


TKIDKLCVWN 



TASLLTFWNP PTTAKLTIES TPFNVAEGKE 
ERVDGNRQII GYVI-GTQQAT PGPAYSGREI 
TLHVIKSDLV NEEATGQFRV YPELPKPSIS 
TQDATYLWWV NNQSLPVSPR LQLSNGNRTL 
VSARRSDSVI LNVLYGPDAP TISPLNTSYR 
WFVNGTFQQS TQELFIPNIT VNNSGSYTCQ 
EPPKPFITSN NSNPVEDEDA VALTCEPEIQ 
LSNDNRTLTL LSVTRNDVGP YECGIQNELS 
SPSYTYYRPG VNLSLSCHAA SNPPAQYSWL 
NSGLYTCQAN NSASGHSRTT VKTITVSAEL 
FTCEPEAQNT TYLWWVNGQS LPVSPRLQLS 
CGIQNSVSAN RSDPVTLDVL YGPDTPIISP 
PSPQYSWRIN GIPQQHTQVL FIAKITPNNN 
SITVSASGTL DAPQSITELC SEYRNTQIYT 
ITFKSGATFQ VEVPGSQHID SQKKAIERMK 
NKTPNSIAAI SMEN (SEQ ID NO: 10) 
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CEAopt-LTB Nucleotide Sequence 

1 ATGGAGA6CC CCAGCGCCCC CCCCCACCGC TGGTGCATCC CCTGGCAGCG 

CCTGCTGCTG ACCGCCAGCC TGCTGACCTT CTGGAACCCC CCCACCACCG 
101 CCAAGCTGAC CATCGAGAGC ACCCCCTTCA ACGTGGCCGA GGGCAAGGAG 

GTGCTGCTGC TGGTGCACAA CCTGCCCCAG CACCTGTTCG GCTACAGCTG 
201 GTACAAGGGC GAGCGCGTGG ACGGCAACCG CCAGATCATC GGCTACGTGA 

TCGGCACCCA GCAGGCCACC CCCGGCCCCG CCTACAGCGG CCGCGAGATC 
301 ATCTACCCCA ACGCCAGCCT GCTGATCCAG AACATCATCC AGAACGACAC 

CGGCTTCTAC ACCCTGCACG TGATCAAGAG CGACCTGGTG AACGAGGAGG 
401 CCACCGGCCA GTTCCGCGTG TACCCCGAGC TGCCCAAGCC -CAGCATCAGC 

AGCAACAACA GCAAGCCCGT GGAGGACAAG GACGCCGTGG CCTTCACCTG 
501 CGAGCCCGAG ACCCAGGACG CCACCTACCT GTGGTGGGTG AACAACCAGA 

GCCTGCCCGT GAGCCCCCGC CTGCAGCTGA GCAACGGCAA CCGCACCCTG 
601 ACCCTGTTCA ACGTGACCCG CAACGACACC GCCAGCTACA AGTGCGAGAC 

CCAGAACCCC GTGAGCGCCC GCCGCAGCGA CAGCGTGATC CTGAACGTGC 
701 TGTACGGCCC CGACGCCCCC ACCATCAGCC CCCTGAACAC CAGCTACCGC 

AGCGGCGAGA ACCTGAACCT GAGCTGCCAC GCCGCCAGCA ACCCCCCCGC 
801 CCAGTACAGC TGGTTCGTGA ACGGCACCTT CCAGCAGAGC ACCCAGGAGC 

TGTTCATCCC CAACATCACC GTGAACAACA GCGGCAGCTA CACCTGCCAG 
901 GCCCACAACA GCGACACCGG CCTGAACCGC ACCACCGTGA CCACCATCAC 

CGTGTACGCC GAGCCCCCCA AGCCCTTCAT CACCAGCAAC AACAGCAACC 
1001 CCGTGGAGGA CGAGGACGCC GTGGCCCTGA CCTGCGAGCC CGAGATCCAG 

AACACCACCT ACCTGTGGTG GGTGAACAAC CAGAGCCTGC CCGTGAGCCC 
1101 CCGCCTGCAG CTGAGCAACG ACAACCGCAC CCTGACCCTG CTGAGCGTGA 

CCCGCAACGA CGTGGGCCCC TACGAGTGCG GCATCCAGAA CGAGCTGAGC 
1201 GTGGACCACA GCGACCCCGT GATCCTGAAC GTGCTGTACG GCCCCGACGA 

CCCCACCATC AGCCCCAGCT ACACCTACTA CCGCCCCGGC GTGAACCTGA 
1301 GCCTGAGCTG CCACGCCGCC AGCAACCCCC CCGCCCAGTA CAGCTGGCTG 

ATCGACGGCA ACATCCAGCA GCACACCCAG - GAGCTGTTCA TCAGCAACAT 
1401 CACCGAGAAG AACAGCGGCC TGTACACCTG CCAGGCCAAC AACAGCGCCA 

GCGGCCACAG CCGCACCACC GTGAAGACCA TCACCGTGAG CGCCGAGCTG 
1501 CCCAAGCCCA GCATCAGCAG CAACAACAGC AAGCCCGTGG AGGACAAGGA 

CGCCGTGGCC TTCACCTGCG AGCCCGAGGC CCAGAACACC ACCTACCTGT 
1601 GGTGGGTGAA CGGCCAGAGC CTGCCCGTGA GCCCCCGCCT GCAGCTGAGC 

AACGGCAACC GCACCCTGAC CCTGTTCAAC GTGACCCGCA ACGACGCCCG 
1701 CGCCTACGTG TGCGGCATCC AGAACAGCGT GAGCGCCAAC CGCAGCGACC 

CCGTGACCCT GGACGTGCTG TACGGCCCCG ACACCCCCAT CATCAGCCCC 
1801 CCCGACAGCA GCTACCTGAG CGGCGCCAAC CTGAACCTGA GCTGCCACAG 

CGCCAGCAAC CCCAGCCCCC AGTACAGCTG GCGCATCAAC GGCATCCCCC 
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AGCAGCACAC CCAGGTGCTG TTCATCGCCA AGATCACCCC CAACAACAAC 
GGCACCTACG CCTGCTTCGT GAGCAACCTG GCCACCGGCC GCAACAACAG 
CATCGTGAAG AGCATCACCG TGAGCGCCAG CGGCACCICT AGAGCTCCCC 
AGACTATTAC AGAACTATGT TCGGAATATC GCAACACACA AATATATACG 
ATAAATGACA AGATACTATC ATATACGGAA TCGATGGCAG GCAAAAGAGA 
AATGGTTATC ATTACATTTA AGAGCGGCGA AACATTTCAG GTCGAAGTCC 
CGGGCAGTCA ACATATAGAC TCCCAGAAAA AAGCCATTGA AAGGATGAAG 
GACACATTAA GAATCACATA TCTGACCGAG ACCAAAATTG ATAAATTATG 
TGTATGGAAT AATAAAACCC CCAATTCAAT TGCGGCAATC AGTATGGAAA 
ACTAG (SEQ ID NO: 11) 



FIG.4B 



WO 2005/077977 



PCT/EP2005/001114 



10/62 

hCEA-LTBopt Coding Sequence 

1 ATGGAGAGCC CCAGCGCCCC CCCCCACCGC TGGTGCATCC CCTGGCAGCG 
CCTGCTGCTG ACCGCCAGCC TGCTGACCTT CTGGAACCCC CCCACCACCG 
101 CCAAGCTGAC CATCGAGAGC ACCCCCTTCA ACGTGGCCGA GGGCAAGGAG 
GTGCTGCTGC TGGTGCACAA CCTGCCCCAG CACCTGTTCG GCTACAGCTG 
201 GTACAAGGGC GAGCGCGTGG ACGGCAACCG CCAGATCATC GGCTACGTGA 
TCGGCACCCA GCAGGCCACC CCCGGCCCCG CCTACAGCGG CCGCGAGATC 
301 ATCTACCCCA ACGCCAGCCT GCTGATCCAG AACATCATCC AGAACGACAC 
CGGCTTCTAC ACCCTGCACG TGATCAAGAG CGACCTGGTG AACGAGGAGG 
401 CCACCGGCCA GTTCCGCGTG TACCCCGAGC TGCCCAAGCC CAGCATCAGC 
AGCAACAACA GCAAGCCCGT GGAGGACAAG GACGCCGTGG CCTTCACCTG 
501 CGAGCCCGAG ACCCAGGACG CCACCTACCT GTGGTGGGTG AACAACCAGA 
GCCTGCCCGT GAGCCCCCGC CTGCAGCTGA GCAACGGCAA CCGCACCCTG 
601 ACCCTGTTCA ACGTGACCCG CAACGACACC GCCAGCTACA AGTGCGAGAC 
CCAGAACCCC GTGAGCGCCC GCCGCAGCGA CAGCGTGATC CTGAACGTGC 
701 TGTACGGCCC CGACGCCCCC ACCATCAGCC CCCTGAACAC CAGCTACCGC 
AGCGGCGAGA ACCTGAACCT GAGCTGCCAC GCCGCCAGCA ACCCCCCCGC 
801 CCAGTACAGC TGGTTCGTGA ACGGCACCTT CCAGCAGAGC ACCCAGGAGC 
TGTTCATCCC CAACATCACC GTGAACAACA GCGGCAGCTA CACCTGCCAG 
901 GCCCACAACA GCGACACCGG CCTGAACCGC ACCACCGTGA CCACCATCAC 
CGTGTACGCC GAGCCCCCCA AGCCCTTCAT CACCAGCAAC AACAGCAACC 
1001 CCGTGGAGGA CGAGGACGCC GTGGCCCTGA CCTGCGAGCC CGAGATCCAG 
AACACCACCT AGCTGTGGTG GGTGAACAAC CAGAGCCTGC CCGTGAGCCC 
1101 CCGCCTGCAG CTGAGCAACG ACAACCGCAC CCTGACCCTG CTGAGCGTGA 
CCCGCAACGA CGTGGGCCCC TACGAGTGCG GCATCCAGAA CGAGCTGAGC 
1201 GTGGACCACA GCGACCCCGT GATCCTGAAC GTGCTGTACG GCCCCGACGA 
CCCCACCATC AGCCCCAGCT ACACCTACTA CCGCCCCGGC GTGAACCTGA 
1301 GCCTGAGCTG CCACGCCGCC AGCAACCCCC CCGCCCAGTA CAGCTGGCTG 
ATCGACGGCA ACATCCAGCA GCACACCCAG GAGCTGTTCA TCAGCAACAT 
1401 CACCGAGAAG AACAGCGGCC TGTA£ACCTG CCAGGCCAAC AACAGCGCCA 
GCGGCCACAG CCGCACCACC GTGAAGACCA TCACCGTGAG CGCCGAGCTG 
1501 CCCAAGCCCA GCATCAGCAG CAACAACAGC AAGCCCGTGG AGGACAAGGA 
CGCCGTGGCC TTCACCTGCG AGCCCGAGGC CCAGAACACC ACCTACCTGT 
1601 GGTGGGTGAA CGGCCAGAGC CTGCCCGTGA GCCCCCGCCT GCAGCTGAGC 
AACGGCAACC GCACCCTGAC CCTGTTCAAC GTGACCCGCA ACGACGCCCG 
1701 CGCCTACGTG TGCGGCATCC AGAACAGCGT GAGCGCCAAC CGCAGCGACC 
CCGTGACCCT GGACGTGCTG TACGGCCCCG ACACCCCCAT CATCAGCCCC 
1801 CCCGACAGCA GCTACCTGAG CGGCGCCAAC CTGAACCTGA GCTGCCACAG 
CGCCAGCAAC CCCAGCCCCC AGTACAGCTG GCGCATCAAC GGCATCCCCC 
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1901 AGCAGCACAC CCAGGTGCTG TTCATCGCCA AGATCACCCC CAACAACAAC 

GGCACCTACG CCTGCTTCGT GAGCAACCTG GCCACCGGCC GCAACAACAG 
2001 CATCGTGAAG AGCATCACCG TGAGCGCCAG CGGCACCTCI AGAGCCCCCC 

AGAGCATCAC CGAGCTGTGC AGCGAGTACC GGAACACCCA GATCTACACC 
2101 ATCAACGACA AGATCCTGAG CTACACCGAG AGCATGGCCG GCAAGAGGGA 

GATGGTGATC ATCACCTTCA AGAGCGGCGC CACCTTCCAG GTGGAGGTGC 
2201 CCGGCAGCCA GCACATCGAC AGCCAGAAGA AGGCCATCGA GCGGATGAAG 

GACACCCTGC GGATCACCTA CCTCACCGAG ACCAAGATCG ACAAGCTGTG 
2301 CGTGTGGAAC AACAAGACCC CCAACAGCAT CGCCGCCATC AGCATGGAGA 

ATTGATAA (SEQ ID NO: 12) 



FIG.5A-2 
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hCEA-LTB Amino Acid Sequence 



1 


MESPSAPPHR 


WCIPWQRLLL 


51 


VLLLVHNLPQ 


HLFGYSWYKG 


101 


IYPNASLLIQ 


NI IQNDTGFY 


151 


SNNSKPVEDK 


DAVAFTCEPE 


201 


TLFNVTRNDT 


ASYKCETQNP 


251 


SGENLNLSCH 


AASNPPAQYS 


301 


AHNSDTGLNR 


TTVTTITVYA 


351 


NTTYLWWVNN 


QSLPVSPRLQ 


401 


VDHSDPVILN 


VLYGPDDPTI 


451 


IDGNIQQHTQ 


ELFISNITEK 


501 


PKPSISSNNS 


KPVEDKDAVA 


551 


NGNRTLTLFN 


VTRNDARAYV 


601 


PDSSYLSGAN 


LNLSCHSASN 


651 


GTYACFVSNL 


ATGRNNSIVK 


701 


INDKILSYTE 


SMAGKREMVI 


751 


DTLRITYLTE 


TKIDKLCVWN 



TASLLTFWNP PTTAKLTIES TPFNVAEGKE 
ERVDGNRQII GYVIGTQQAT PGPAYSGREI 
TLHVIKSDLV NEEATGQFRV YPELPKPSIS 
TQDATYLWWV NNQSLPVSPR LQLSNGNRTL 
VSARRSDSVI LNVLYGPDA? TISPLNTSYR 
WFVNGTFQQS TQELFI PNI~ VNNSGSYTCQ 
EPPKPFITSN NSNPVEDEDA VALTCEPEIQ 
LSNDNRTLTL LSVTRNDVGP YECGIQNELS 
SPSYTYYRPG VNLSLSCHAA SNPPAQYSWL 
NSGLYTCQAN NSASGHSRTT VKTITVSAEL 
FTCEPEAQNT TYLWWVNGQS LPVSPRLQLS 
CGIQNSVSAN RSDPVTLDVL YGPDTPIISP 
PSPQYSWRIN GIPQQHTQVL FIAKITPNNN 
SITVSASGTS RAPQSITELC SEYRNTQIYT 
ITFKSGATFQ VEVPGSQHID SQKKAIERMK 
NKTPNSIAAI SMEN (SEQ ID NO: 13) 
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RhCEAopt-LTBopt Amino Acid Sequence 

1 MGSPSAPLHR WCIPWQTLLL TASLLTFWNP PTTAQLTIES RPFNVAEGKE 

51 VLLLAHNVSQ NLFGYIWYKG ERVDASRRIG SCVIRTQQIT PGPAHSGRET 

101 IDFNASLLIH NVTQSDTGSY TIQVIKEDLV NEEATGQFRV YPELPKPYIS 

151 SNNSNPVEDK DAVALTCEPE TQDTTYLWWV NNQSLPVSPR LELSSDNRTL 

201 TVFNIPRNDT TSYKCETQNP VSVRRSDPVT LNVLYGPDAP TISPLNTPYR 

251 AGENLNLTCH AASNPTAQYF WFVNGTFQQS TQELFIPNIT VNNSGSYMCQ 

301 AHNSATGLNR TTVTAITVYA ELPKPYITSN NSNPIEDKDA VTLTCEPETQ 

351 DTTYLWWVNN QSLSVSSRLE LSNDNRTLTV FNIPRNDTTF YECETQNPVS 

401 VRRSDPVTLN VLYGPDAPTI SPLNTPYRAG ENLNLSCHAA SNPAAQYSWF 

451 VNGTFQQSTQ ELFIPNITVN NSGSYMCQAH NSATGLNRTT VTAITVYVEL 

501 PKPYISSNNS NPIEDKDAVT LTCEPVAENT TYLWWVNNQS LSVSPRLQLS 

551 NGNRILTLLS VTRNDTGPYE CGIQNSESAK RSDPVTLNVT YGPDTPIISP 

601 PDLSYRSGAN LNLSCHSDSN PSPQYSWLIN GTLRQHTQVL FISKITSNNS 

651 GAYACFVSNL ATGRNNSIVK NISVSSGDSS RAPQSITELC SEYRNTQIYT 

701 INDKILSYTE SMAGKREMVI ITFKSGATFQ VEVPGSQHID SQKKAIERMK 

751 DTLRITYLTE TKIDKLCVWN NKTPNSIAAI SMEN (SEQ ID NO: 15) 



FIG.6B 
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Nucleotide Sequence of First Rhesus Monkey CEA 



1 


ATGGGGTCTC 


CCTCAGCCCC 


51 


GCTCCTGCTC 


ACAGCCTCAC 


101 


CCCAGCTCAC 


TATTGAATCC 


151 


GTTCTTCTAC 


TTGCCCACAA 


201 


GTACAAGGGA 


GAAAGAGTGG 


251 


TAAGAACTCA 


ACAAATTACC 


301 


ATAGACTTCA 


ATGCATCCCT 


351 


AGGATCCTAC 


ACCATACAAG 


401 


CAACTGGCCA 


GTTCCGGGTA 


451 


AGCAACAACT 


CCAACCCCGT 


501 


TGAACCTGAG 


ACTCAGGACA 


551 


GCCTCCCGGT 


CAGTCCCAGG 


601 


ACTGTATTCA 


ATATTCCAAG 


651 


CCAGAACCCA 


GTGAGTGTCA 


701 


TCTATGGCCC 


GGATGCGCCC 


751 


GCAGGGGAAA 


ATCTGAACCT 


801 


ACAGTACTTT 


TGGTTTGTCA 


851 


TCTTTATACC 


CAACATCACC 


901 


GCCCATAACT 


CAGCCACTGG 


951 


AGTCTACGCG 


GAGCTGCCCA 


1001 


CCATAGAGGA 


CAAGGATGCT 


1051 


GACACAACCT 


ACCTGTGGTG 


1101 


CAGGCTGGAG 


CTGTCCAATG 


1151 


CAAGAAACGA 


CACAACGTTC 


1201 


GTCAGACGCA 


GCGACCCAGT 


1251 


GCCCACCATT 


TCCCCTCTAA 


1301 


ACCTCTCCTG 


CCACGCAGCC 


1351 


GTCAATGGGA 


CGTTCCAGCA 


1401 


CACCGTGAAT 


AATAGCGGAT 


1451 


CTGGCCTCAA 


TAGGACCACA 


1501 


CCCAAGCCCT 


ACATCTCCAG 


1551 


TGCTGTGACC 


TTAACCTGTG 


1601 


GGTGGGTAAA 


CAATCAGAGC 


1651 


AATGGCAACA 


GGATCCTCAC 


1701 


ACCCTATGAA 


TGTGGAATCC 


1751 


CAGTCACCCT 


GAATGTCACC 


1801 


CCAGACTTGT 


CTTACCGTTC 



TCTTCACAGA TGGTGCATCC CCTGGCAGAC 
TTCTAACCTT CTGGAACCCG CCCACCACTG 
AGGCCGTTCA ATGTTGCAGA GGGGAAGGAG 
TGTGTCCCAG AATCTTTTTG GCTACATTTG 
ATGCCAGCCG TCGAATTGGA TCATGTGTAA 
CCAGGGCCCG CACACAGCGG TCGAGAGACA 
GCTGATCCAC AATGTCACCC AGAGTGACAC 
TCATAAAGGA AGATCTTGTG AATGAAGAAG 
TACCCGGAGC TGCCCAAGCC CTACATCTCC 
GGAGGACAAG GATGCTGTGG CCTTAACCTG 
CAACCTACCT GTGGTGGGTA AACAATCAGA 
CTGGAGCTGT CCAGTGACAA CAGGACCCTC 
AAATGACACA ACATCCTACA AATGTGAAAC 
GACGCAGCGA CCCAGTCACC CTGAACGTCC 
ACCATTTCCC CTCTAAACAC ACCTTACAGA 
CACCTGCCAC GCAGCCTCTA ACCCAACTGC 
ATGGGACGTT CCAGCAATCC ACACAAGAGC 
GTGAATAATA GCGGATCCTA TATGTGCCAA 
CCTCAATAGG ACCACAGTCA CGGCGATCAC 
AGCCCTACAT CACCAGCAAC AACTCCAACC 
GTGACCTTAA CCTGTGAACC TGAGACTCAG 
GGTAAACAAT CAGAGCCTCT CGGTCAGTTC 
ACAACAGGAC CCTCACTGTA TTCAATATTC 
TACGAATGTG AGACCCAGAA CCCAGTGAGT 
CACCCTGAAT GTCCTCTATG GCGCGGATGC 
ACACACCTTA CAGAGCAGGG GAAAATCTGA 
TCTAACCCAG CTGCACAGTA CTGTTGGTTT . 
ATCCACACAA GAGCTCTTTA TAGCGAACAT 
CCTATATGTG CCAAGCCCAT AAGTCAGCCA 
GTCACGGCGA TCACAGTCTA TGTGGAGCTG 
CAACAACTCC AACCCCATAG AGGACAAGGA 
AACCTGTGGC TGAGAACACA ACCTACCTGT 
CTCTCGGTCA GTCCCAGGCT GCAGCTCTCC 
TCTACTCAGT GTCACACGGA ATGACACAGG 
AGAACTCAGA GAGTGCAAAA CGCAGTGACC 
TATGGCCCAG ACACCCCCAT CATATCCCCC 
GGGAGCAAAC CTCAACCTCT CCTGCCACTC 
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1851 GGACTCTAAC CCATCCCCGC AGTATTCTTG GCTTATCAAT GGGACACTGC 

1901 GGCAACACAC ACAAGTTCTC TTTATCTCCA AAATCACATC AAACAATAGC 

1951 GGGGCCTATG CCTGTTTTGT CTCTAACTTG GCTACCGGTC GCAATAACTC 

2001 CATAGTCAAG AACATCTCAG TCTCCTCTGG CGATTCAGCA CCTGGAAGTT 

2051 CTGGTCTCTC AGCTAGGGCT ACTGTCGGCA TCATAATTGG AATGCTGGTT 

2101 GGGGTTGCTC TGATGTAG (SEQ ID NO: 16) 



FIG.7A-2 
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Nucleotide Sequence of Second Rhesus Monkey CEA 



1 


ATGGGGTCTC 


CCTCAGCCCC 


51 


GCTCCTGCTC 


ACAGCCTCAC 


101 


CCCAGCTCAC 


TATTGAATCC 


151 


GTTCTTCTAC 


TTGCCCACAA 


201 


GTACAAGGGA 


GAAAGAGTGG 


251 


TAAGAACTCA 


ACAAATTACC 


301 


ATAGACTTCA 


ATGCATCCCT 


351 


AGGATCCTAC 


ACCATACAAG 


401 


CAACTGGCCA 


GTTCCGGGTA 


451 


AGCAACAACT 


CCAACCCCGT 


501 


TGAACCTGAG 


ACTCAGGACA 


551 


GCCTCCCGGT 


CAGTCCCAGG 


601 


ACTGTATTCA 


ATATTCCAAG 


651 


CCAGAACCCA 


GTGAGTGTCA 


701 


TCTATGGCCC 


GGATGCGCCC 


751 


GCAGGGGAAA 


ATCTGAACCT 


801 


ACAGTACTTT 


TGGTTTGTCA 


851 


TCTTTATACC 


CAACATCACC 


901 


GCCCATAACT 


CAGCCACTGG 


951 


AGTCTACGCG 


GAGCTGCCCA 


1001 


CCATAGAGGA 


CAAGGATGCT 


1051 


GACACAACCT 


ACCTGTGGTG 


1101 


CAGGCTGGAG 


CTGTCCAATG 


1151 


CAAGAAACGA 


CACAACGTTC 


1201 


GTCAGACGCA 


GCGACCCAGT 


1251 


GCCCACCATT 


TCCCCTCTAA 


1301" 


ACCTCTCCTG 


CCACGCAGCC 


1351 


GTCAATGGGA 


CGTTCCAGCA 


1401 


CACCGTGAAT 


AATAGCGGAT 


1451 


CTGGCCTCAA 


TAGGACCACA 


1501 


CCCAAGCCCT 


ACATCTCCAG 


1551 


TGCTGTGACC 


TTAACCTGTG 


1601 


GGTGGGTAAA 


CAATCAGAGC 


1651 


AATGGCAACA 


GGATCCTCAC 


1701 


ACCCTATGAA 


TGTGGAATCC 


1751 


CAGTCACCCT 


GAATGTCACC 


1801 


CCAGACTTGT 


CTTACCGTTC 



TCTTCACAGA TGGTGCATCC CCTGGCAGAC 
TTCTAACCTT CTGGAACCCG CCCACGACTG 
AGGCCGTTCA ATGTTGCAGA GGGGAAGGAG 
TGTGTCCCAG AATCTTTTTG GCTACATTTG 
ATGCCAGCCG TCGAATTGGA TCATGTGTAA 
CCAGGGCCCG CACACAGCGG TCGAGAGACA 
GCTGATCCAC AATGTCACCC AGAGTGACAC 
TCATAAAGGA AGATCTTGTG AATGAAGAAG 
TACCCGGAGC TGCCCAAGCC CTACATCTCC 
GGAGGACAAG GATGCTGTGG CCTTAACCTG 
CAACCTACCT GTGGTGGGTA AACAATCAGA 
CTGGAGCTGT CCAGTGACAA CAGGACCCTC 
AAATGACACA ACATCCTACA AATGTGAAAC 
GACGCAGCGA CCCAGTCACC CTGAACGTCC 
ACCATTTCCC CTCTAAACAC ACCTTACAGA 
CACCTGCCAC GCAGCCTCTA ACCCAACTGC 
ATGGGACGTT CCAGCAATCC ACACAAGAGC 
GTGAATAATA GCGGATCCTA TATGTGCCAA 
CCTCAATAGG ACCACAGTCA CGGCGATCAC 
AGCCCTACAT CACCAGCAAC AACTCCAACC 
GTGACCTTAA CCTGTGAACC TGAGACTCAG 
GGTAAACAAT CAGAGCCTCT CGGTCAGTTC 
ACAACAGGAC CCTCACTGTA TTCAATATTC 
TACGAATGTG AGACCCAGAA CCCAGTGAGT 
CACCCTGAAT GICCTCTATG GCCCGGATGC 
ACACACCTTA CAGAGCAGGG GAAAATCTGA 
TCTAACCCAG CTGCACAGTA CTTTTGGTTT 
ATCCACACAA GAGCTCTTTA TACCGAACAT 
CCTATATGTG CCAAGCCCAT AACTCAGCCA 
GTCACGGCGA TCACAGTCTA TGTGGAGCTG 
CAACAACTCC AACCCCATAG AGGACAAGGA 
AACCTGTGGC TGAGAACACA ACCTACCTGT 
CTCTCGGTCA GTCCCAGGCT GCAGCTCTCC 
TCTACTCAGT GTCACACGGA ATGACACAGG 
AGAACTCAGA GAGTGCAAAA CGCAGTGACC 
TATGGCCCAG ACACCCCCAT CATATCCCCC 
GGGAGCAAAC CTCAACCTCT CCTGCCACTC 
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1851 GGACTCTAAC CCATCCCCGC AGTATTCTTG GCTTATCAAT GGGACACTGC 

1901 GGCAACACAC ACAAGTTCTC TTTATCTCCA AAATCACATC AAACAATAAC 

1951 GGGGCCTATG CCTGTTTTGT CTCTAACTTG GCTACCGGTC GCAATAACTC 

2001 CATAGTCAAG AACATCTCAG TCTCCTCTGG CGATTCAGCA CCTGGAAGTT 

2051 CTGGTCTCTC AGCTAGGGCT ACTGTCGGCA TCATAATTGG AATGCTGGTT 

2101 GGGGTTGCTC TGATGTAG (SEQ ID NO: 17) 
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Amino Acid Sequence of First Rhesus Monkey CEA Protein 

1 MGSPSAPLHR WCIPWQTLLL TASLLTFWNP PTTAQLTIES RPFNVAEGKE 

51 VLLLAHNVSQ NLFGYIWYKG ERVDASRRIG SCVIRTQQIT PGPAHSGRET 

101 IDFNASLLIH NVTQSDTGSY TIQVIKEDLV NEEATGQFRV YPELPKPYIS 

151 SNNSNPVEDK DAVALTCEPE TQDTTYLWWV NNQSLPVSPR LELSSDNRTL 

201 TVFNIPRNDT TSYKCETQNP VSVRRSDPVT LNVLYGPDAP TISPLNTPYR 

251 AGENLNLTCH AASNPTAQYF WFVNGTFQQS TQELFI PNIT VNNSGSYMCQ 

301 AHNSATGLNR TTVTAITVYA ELPKPYITSN NSNPIEDKDA VTLTCEPETQ 

351 DTTYLWWVNN QSLSVSSRLE LSNDNRTLTV FNIPRNDTTF YECETQNPVS 

401 VRRSDPVTLN VLYGPDAPTI SPLNTPYRAG ENLNLSCHAA SNPAAQYSWF 

451 VNGTFQQSTQ ELFIPNITVN NSGSYMCQAH NSATGLNRTT VTAITVYVEL 

501 PKPYISSNNS NPIEDKDAVT LTCEPVAENT TYLWWVNNQS LSVSPRLQLS 

551 NGNRILTLLS VTRNDTGPYE CGIQNSESAK RSDPVTLNVT YGPDTPIISP 

601 PDLSYRSGAN LNLSCHSDSN PSPQYSWLIN GTLRQHTQVL FISKITSNNS 

651 GAYACFVSNL ATGRNNSIVK NISVSSGDSA PGSSGLSARA TVGIIIGMLV 

701 GVALM (SEQ ID NO: 18) 

FIG.7C 

Amino Acid Sequence of Second Rhesus Monkey CEA Protein 

1 MGSPSAPLHR WCIPWQTLLL TASLLTFWNP PTTAQLTIES RPFNVAEGKE - 

51 VLLLAHNVSQ NLFGYIWYKG ERVDASRRIG SCVIRTQQIT PGPAHSGRET 

101 IDFNASLLIH NVTQSDTGSY TIQVIKEDLV NEEATGQFRV YPELPKPYIS 

151 SNNSNPVEDK DAVALTCEPE TQDTTYLWWV NNQSLPVSPR LELSSDNRTL 

201 TVFNIPRNDT TSYKCETQNP VSVRRSDPVT LNVLYGPDAP TISPLNTPYR 

251 AGENLNLTCH AASNPTAQYF WFVNGTFQQS- TQELFI PNIT VNNSGSYMCQ 

301 AHNSATGLNR TTVTAITVYA ELPKPYITSN NSNPIEDKDA VTLT.GEPfiTQ 

351 DTTYLWWVNN QSLSVSSRLE LSNDNRTLTV FNIPRNDTTF YECETQNPVS 

401 VRRSDPVTLN VLYGPDAPTI SPLNTPYRAG ENLNLSCHAA SNPAAQYFWF 

451 VNGTFQQSTQ ELFIPNITVN NSGSYMCQAH NSATGLNRTT VTAITVYVEL 

501 PKPYISSNNS NPIEDKDAVT LTCEPVAENT TYLWWVNNQS LSVSPRLQLS 

551 NGNRILTLLS VTRNDTGPYE CGIQNSESAK RSDPVTLNVT YGPDTPIISP 

601 PDLSYRSGAN LNLSCHSDSN PSPQYSWLIN GTLRQHTQVL FISKITSNNN 

651 GAYACFVSNL ATGRNNSIVK NISVSSGDSA PGSSGLSARA TVGIIIGMLV 

701 GVALM (SEQ ID NO: 19) 



FIG.7D 



WO 2005/077977 



PCT/EP2005/001114 



21/62 



Amino Acid Sequence of 


Human CEA 1 


^rotein 






1 


K>ir*c nr* a nni in 

MESPSAPPHR 


WCIPWQRLLL 


TASLLTFWNP 


PTTAKLTIES 


TPFNVAEGKE 


51 


VLLLVHNLPQ 


HLFGYSWYKG 


ERVDGNRQII 


GYVIGTQQAT 


PGPAYSGREI 


1U1 


IYPNASLLIQ 


NIIQNDTGFY 


TLHVIKSDLV 


NEEATGQFRV 


YPELPKPSIS 


1 ri 


CMMCi/n\/rni/ 

bNNSKPVEDK 


DAVAFTCEPE 


TQDATYLWWV 


NNQSLPVSPR 


LQLSNGNRTL 


on i 


TLFNVTRNDT 


ASYKCETQNP 


VSARRSDSVI 


LNVLYGPDAP 


TISPLNTSYR 


Or 1 


rorMi mi coil 


AASNPPAQYS 


WFVNGTFQQS 


TQELFI PNIT 


VNNSGSYTCQ 


on i 
JUl 


Al iMrnTPi Kin 

AHNSDTGLNR 


TTVTTITVYA 


EPPKPFITSN 


NSNPVEDEDA 


VALTCEPEIQ 


Of — I 

351 


NTTYLWWVNN 


QSLPVSPRLQ 


LSNDNRTLTL 


LSVTRNDVGP 


YECGIQNELS 


401 


VDHSDPVILN 


VLYGPDDPTI 


SPSYTYYRPG 


VNLSLSCHAA 


SNPPAQYSWL 


451 


IDGNIQQHTQ 


ELFISNITEK 


NSGLYTCQAN 


NSASGHSRTT 


VKTITVSAEL 


501 


PKPSISSNNS 


KPVEDKDAVA 


FTCEPEAQNT 


TYLWWVNGQS 


LPVSPRLQLS 


551 


NGNRTLTLFN 


VTRNDARAYV 


CGIQNSVSAN 


RSDPVTLDVL 


YGPDTPIISP 


601 


PDSSYLSGAN 


LNLSCHSASN 


PSPQYSWRIN 


GIPQQHTQVL 


FIAKITPNNN 


651 


GTYACFVSNL 


ATGRNNSIVK 


SITVSASGTS 


PGLSAGATVG 


IMIGULVGVA 


701 


LI (SEQ ID 


N0:20) 
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MESP SAP PHR WCIP W Q R • 
1 ATGGAGAGCC CCAGCGCCCC CCCCCACCGC TGGTGCATCC CCTGGCAGCG 

•LLL TASL LTF WNP PTTA 
51 CCTGCTGCTG ACCGCCAGCC TGCTGACCTT CTGGAACCCC CCCACCACCG 

•KIT IES TPFN V A E GKE 
101 CCAAGCTGAC CATCGAGAGC ACCCCCTTCA ACGTGGCCGA GGGCAAGGAG 

VLLL VHN LPQ HLFG Y S W • 
151 GTGCTGCTGC TGGTGCACAA CCTGCCCCAG CACCTGTTCG GCTACAGCTG 

■ Y K G ERVD GNR QII GYVI 
201 GTACAAGGGC GAGCGCGTGG ACGGCAACCG CCAGATCATC GGCTACGTGA 

G T Q QAT PGPA YSG REI 
251 TCGGCACCCA GCAGGCCACC CCCGGCCCCG CCTACAGCGG CCGCGAGATC 

IYPN ASL LIQ N I I Q N D T • 
301 ATCTACCCCA ACGCCAGCCT GCTGATCCAG AACATCATCC AGAACGACAC 

•GFY TLHV IKS DLV NEEA 
351 CGGCTTCTAC ACCCTGCACG TGATCAAGAG CGACCTGGTG AACGAGGAGG 

■TGQ FRV YPEL PKP SIS 
401 CCACCGGCCA GTTCCGCGTG TACCCCGAGC TGCCCAAGCC CAGCATCAGC 

SNNS KPV EDK DAVA FTC- 
451 AGCAACAACA GCAAGCCCGT GGAGGACAAG GACGCCGTGG CCTTCACCTG 

•EPE TQDA TYL WWV NNQS 
501 CGAGCCCGAG ACCCAGGACG CCACCTACCT GTGGTGGGTG AACAACCAGA 

•LPV SPR LQLS NGN RTL 
551 GCCTGCCCGT GAGCCCCCGC CTGCAGCTGA GCAACGGCAA CCGCACCCTG 

TLFN VTR NDT ASYK GET- 
601 ACCCTGTTCA ACGTGACCCG CAACGACACC GCCAGCTACA AGTGCGAGAC 

■QNP VSAR RSD SVI LNVL 
651 CCAGAACCCC GTGAGCGCCC GCCGCAGCGA CAGCGTGATC CTGAACGTGC 

•YGP DAP TISP LNT SYR 
701 TGTACGGCCC CGACGCCCCC ACCATCAGCC CCCTGAACAC CAGCTACCGC 

SGEN LNL SCH AASN P P A • 
751 AGCGGCGAGA ACCTGAACCT GAGCTGCCAC GCCGCCAGCA ACCCCCCCGC 

•QYS WFVN GTF QQS TQEL 
801 CCAGTACAGC TGGTTCGTGA ACGGCACCTT CCAGCAGAGC ACCCAGGAGC 

•FIP NIT VNNS GSY TCQ 
851 TGTTCATCCC CAACATCACC GTGAACAACA GCGGCAGCTA CACCTGCCAG 

AHNS DTG LNR TTVT TIT' 
901 GCCCACAACA GCGACACCGG CCTGAACCGC ACCACCGTGA CCACCATCAC 

•VYA EPPK PFI TSN NSNP 
951 CGTGTACGCC GAGCCCCCCA AGCCCTTCAT CACCAGCAAC AACAGCAACC 

•VED EDA VALT CEP EIQ 
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1001 CCGTGGAGGA CGAGGACGCC GTGGCCCTGA CCTGCGAGCC CGAGATCCAG 

NTTY LWW VNN QSLP V S P • 
1051 AACACCACCT ACCTGTGGTG GGTGAACAAC CAGAGCCTGC CCGTGAGCCC 

•RLQ LSND NRT LTL L S V T • 
1101 CCGCCTGCAG CTGAGCAACG ACAACCGCAC CCTGACCCTG CTGAGCGTGA 

R N D VGP YECG IQN ELS 
1151 CCCGCAACGA CGTGGGCCCC TACGAGTGCG GCATCCAGAA CGAGCTGAGC 

VDHS DPV ILN VLYG P D D • 
1201 GTGGACCACA GCGACCCCGT GATCCTGAAC GTGCTGTACG GCCCCGACGA 

•PTI SPSY TYY RPG V N L S - 
1251 CCCCACCATC AGCCCCAGCT ACACCTACTA CCGCCCCGGC GTGAACCTGA 

•LSC HAA SNPP AQY SWL 
1301 GCCTGAGCTG CCACGCCGCC AGCAACCCCC CCGCCCAGTA CAGCTGGCTG 

IDG N IQQ HTQ ELFI S N I • 
1351 ATCGACGGCA ACATCCAGCA GCACACCCAG GAGCTGTTCA TCAGCAACAT 

• T E K NSGL YTC QAN N S A S • 
1401 CACCGAGAAG AACAGCGGCC TGTACACCTG CCAGGCCAAC AACAGCGCCA 

G H S RTT VKTI TVS AEL 
1451 GCGGCCACAG CCGCACCACC GTGAAGACCA TCACCGTGAG CGCCGAGCTG 

PKPS ISS NNS KPVE D K D • 
1501 CCCAAGCCCA GCATCAGCAG CAACAACAGC AAGCCCGTGG AGGACAAGGA 

•AVA FTCE PEA QNT TYLW- 
1551 CGCCGTGGCC TTCACCTGCG AGCCCGAGGC CCAGAACACC ACCTACCTGT 

•WVN GQS LPVS PRL QL S 
1601 GGTGGGTGAA CGGCCAGAGC CTGCCCGTGA GCCCCCGCCT GCAGCTGAGC 

NGNR TLT L F N VTRN D -A R • 
1651 AACGGCAACC GCACCCTGAC CCTGTTCAAC GTGACCCGCA ACGACGCCCG 

• A Y V CGIQ NSV SAN R S D <P - 
1701 CGCCTACGTG TGCGGCATCC AGAACAGCGT GAGCGCCAAC .^QeAGGGACC- — 

• V T ' L DVL YGPD TPI IS. J 3 , 
1751 CCGTGACCCT GGACGTGCTG TACGGCCCCG ACACCCCCAT £AT.CAGCG ; CC . 

PDSS YLS G A N • L N L S " C .H jS - 
1801 CCCGACAGCA GCTACCTGAG CGGCGCCAAC CTGAACCTGA -GCTGCCACAG 

• A S N PSPQ YSW R I" N G I jP ,Q v 
1851 CGCCAGCAAC CCCAGCCCCC AGTACAGCTG GCGCATCAAC GGCATCCCCG . . , 

•QHT QVL FIAK ITP N N N 
1901 AGCAGCACAC CCAGGTGCTG TTCATCGCCA AGATCACCCC CAACAACAAC 

GTYA CFV SNL ATGR NNS- 
1951 GGCACCTACG CCTGCTTCGT GAGCAACCTG GCCACCGGCC GCAACAACAG 

• I V K SITV SAS GTS RSTP- 
2001 CATCGTGAAG AGCATCACCG TGAGCGCCAG CGGCACCICI AGAAGCACCC 

•IPF SYS KNLD CWV DNE 
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2051 CCATCCCATT CAGCTACAGC AAGAACCTGG ACTGCTGGGT GGACAACGAG 

EDID VIL KKS TILN L D I - 
2101 GAGGACATCG ACGTGATCCT GAAGAAGAGC ACCATCCTGA ACCTGGACAT 
•NND I I S D ISG F N S SVIT 
2151 CAACAACGAC ATCATCAGCG ACATCAGCGG CTTCAACAGC AGCGTGATCA 

Y P D AQL VPGI N G K A I H 
2201 CCTACCCCGA CGCCCAGCTG GTGCCCGGCA TCAACGGCAA GGCCATCCAC 

L V N N ESS EVI VHKA M D I - 
2251 CTGGTGAACA ACGAGAGCAG CGAGGTGATC GTGCACAAGG CCATGGACAT 
•EYN DMFN NFT VSF WLRV 
2301 CGAGTACAAC GACATGTTCA ACAACTTCAC CGTGAGCTTC TGGCTGAGAG 

- PKV SAS HLEQ YGT NEY 
2351 TGCCTAAGGT GAGCGCCAGC CACCTGGAGC AGTACGGCAC CAACGAGTAC 

S I I S SMK KHS LSIG S G W • 
2401 AGCATCATCA GCAGCATGAA GAAGCACAGC CTGAGCATCG GCAGCGGCTG 

■SVS LKGN NLI WTL KDSA 
2451 GAGCGTGAGC CTGAAGGGCA ACAACCTCAT CTGGACCCTG AAGGATAGCG 

• GEV RQI TFRD LPD KFN 
2501 CCGGAGAGGT GAGACAGATC ACCTTCAGAG ACCTGCCCGA CAAGTTCAAT 

AYLA NKW VFI TITN D R L ■ 
2551 GCCTACCTGG CCAACAAGTG GGTGTTCATC ACCATCACCA ACGACAGACT 
•SSA NLYI NGV LMG SAEI 
2601 GAGCAGCGCC AACCTGTACA TCAACGGCGT GCTCATGGGC AGCGCCGAGA 

•TGL G A I R E D N NIT LKL 
2651 TCACCGGCCT GGGCGCCATC AGAGAGGACA ACAACATCAC GCTGAAGCTG 
DRCN NNN QYV SIDK FRI 
2701 GACAGATGCA ACAACAACAA CCAGTACGTG AGCATCGACA AGTTCCGGAT" 
•FCK ALNP K E I EKL YTSY 
2751 CTTCTGCAAG GCCCTGAACC CCAAGGAGAT CGAGAAGCTG TACACCAGCI 

•LSI TFL RDFW GNP LRY 
2801 ACCTGAGCAT CACCTTCCTG AGAGACTTCT GGGGCAACCC CCTGAGATAC 

D T * (SEQ ID N0:45) 
2851 GACACCTAG (SEQ ID NO: 21) 
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MESP SAP PHR WCIP WQR LLL 
1 ATGGAGAGCC CCAGCGCCCC CCCCCACCGC TGGTGCATCC CCTGGCAGCG CCTGCTGCTG 

TASL LTF WNP P T T A KLT IES 
61 ACCGCCAGCC TGCTGACCTT CTGGAACCCC CCCACCACCG CCAAGCTGAC CATCGAGAGC 

TPFN V A E GKE VLLL VHN LPQ 
121 ACCCCCTTCA ACGTGGCCGA GGGCAAGGAG GTGCTGCTGC TGGTGCACAA CCTGCCCCAG 

HLFG YSW YKG ERVD GNR QII 
181 CACCTGTTCG GCTACAGCTG GTACAAGGGC GAGCGCGTGG ACGGCAACCG CCAGATCATC 

GYVI GTQ QATPGPA YSG R E I 
241 GGCTACGTGA TCGGCACCCA GCAGGCCACC CCCGGCCCCG CCTACAGCGG CCGCGAGATC 

I Y P N ASL L I Q N I I Q NDT GFY 
301 ATCTACCCCA ACGCCAGCCT GCTGATCCAG AACATCATCC AGAACGACAC CGGCTTCTAC 

TLHV IKS DLV NEEA TGQ FRV 
361 ACCCTGCACG TGATCAAGAG CGACCTGGTG AACGAGGAGG CCACCGGCCA GTTCCGCGTG 

YPEL PKP SIS SNNS KPV E D K 
421 TACCCCGAGC TGCCCAAGCC CAGCATCAGC AGCAACAACA GCAAGCCCGT GGAGGACAAG 

DAVA FTC EPE TQDA TYL WWV 
481 GACGCCGTGG CCTTCACCTG CGAGCCCGAG ACCCAGGACG CCACCTACCT GTGGTGGGTG 

NNQS LPV SPR LQLS NGN RTL 
541 AACAACCAGA GCCTGCCCGT GAGCCCCCGC CTGCAGCTGA GCAACGGCAA CCGCACCCTG 

TLFN VTR NDT ASYK CET QNP 
601 ACCCTGTTCA ACGTGACCCG CAACGACACC GCCAGCTACA AGTGCGAGAC CCAGAACCCC 

VSAR RSD SVI LNVL YGP DAP 
661 GTGAGCGCCC GCCGCAGCGA CAGCGTGATC CTGAACGTGC TGTACGGCCC CGACGCCCCC 

TISP LNT SYR S GEN LNL SCH 
721 ACCATCAGCC CCCTGAACAC CAGCTACCGC AGCGGCGAGA ACCTGAACCT GAGCTGCCAC 

AASN PPA QYS WFVN GTF QQS 
781 GCCGCCAGCA ACCCCCCCGC CCAGTACAGC TGGTTCGTGA ACGGCACCTT. CCAGCAGAGC 

f Q E L FIP NIT VNNS ' G S_ Y T"C "Q " 
841 ACCCAGGAGC TGTTCATCCC CAACATCACC GTGAACAACA GCGGCAGCTA:_CACCTGCCAG 

A II N S DTG LNR TT V T > ..T I T. - ; V Y A ' 
901 GCCCACAACA GCGACACCGG CCTGAACCGC ACCACCGTGA CCACCATCAO CGTGTACGCC 

EPPK P F I TSN NSNP VED EDA" 
961 GAGCCCCCCA AGCCCTTCAT CACCAGCAAC AACAGCAACC CCGTGGAGGA CGAGGACGCC 

VALT CEP EIQ N T T Y L W W V N N ■- 
1021 GTGGCCCTGA CCTGCGAGCC CGAGATCCAG AACACCACCT ACCTGTGGTG GGTGAACAAC 

QSLP VSP RLQ LSND NR T LTL 
1081 CAGAGCCTGC CCGTGAGCCC CCGCCTGCAG CTGAGCAACG ACAACCGCAC CCTGACCCTG 

LSVT RND VGP YECG IQN ELS 
1141 CTGAGCGTGA CCCGCAACGA CGTGGGCCCC TACGAGTGCG GCATCCAGAA CGAGCTGAGC 

VDHS DPV ILN VLYG PDD PTI 
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1201 GTGGACCACA GCGACCCCGT GATCCTGAAC GTGCTGTACG GCCCCGACGA CCCCACCATC 

SPSY TYY RPG VNLS LSC HAA 

1261 AGCCCCAGCT ACACCTACTA CCGCCCCGGC GTGAACCTGA GCCTGAGCTG CCACGCCGCC 

SNPP AQY SWL IDGN IQQ HTQ 

1321 AGCAACCCCC CCGCCCAGTA CAGCTGGCTG ATCGACGGCA ACATCCAGCA GCACACCCAG 

ELFI S N I ' T E K NSGL YTC QAN 

1381 GAGCTGTTCA TCAGCAACAT CACCGAGAAG AACAGCGGCC TGTACACCTG CCAGGCCAAC 

NSAS GHS RTT VKTI TVS AEL 

1441 AACAGCGCCA GCGGCCACAG CCGCACCACC GTGAAGACCA TCACCGTGAG CGCCGAGCTG 

PKPS ISS NNS KPVE DKD A V A 

1501 CCCAAGCCCA GCATCAGCAG CAACAACAGC AAGCCCGTGG AGGACAAGGA CGCCGTGGCG 

FTCE PEA QNT TYLW WVN GQS 

1561 TTCACCTGCG AGCCCGAGGC CCAGAACACC ACCTACCTGT GGTGGGTGAA CGGCCAGAGC 

LPVS PRE QLS NGNR TLT LFN 

1621 CTGCCCGTGA GCCCCCGCCT GCAGCTGAGC AACGGCAACC GCACCCTGAC CCTGTTCAAC 

VTRN DAR A Y V CGIQ NSV SAN 

1681 GTGACCCGCA ACGACGCCCG CGCCTACGTG TGCGGCATCC AGAACAGCGT GAGCGCCAAC 

RSDP VTL D. V L YGPD TPI ISP 

1741 CGCAGCGACC CCGTGACCCT GGACGTGCTG TACGGCCCCG ACACCCCCAT CATCAGCCCC 

PDSS YLS GAN LNLS CHS ASN 

1801 CCCGACAGCA GCTACCTGAG CGGCGCCAAC CTGAACCTGA GCTGCCACAG CGCCAGCAAC 

PSPQ YSW RIN GIPQ QHT QVL 

1861 CCCAGCCCCC AGTACAGCTG GCGCATCAAC GGCATCCCCC AGCAGCACAC CCAGGTGCTG 

F I A K ITP NNN GTYA CFV SNL 

1921 TTCATCGCCA AGATCACCCC CAACAACAAC GGCACCTACG CCTGCTTCGT GAGCAACCTG 

ATGR NNS I V K SITV SAS GTS- 

1981 GCCACCGGCC GCAACAACAG CATCGTGAAG AGCATCACCG TGAGCGCCAG CGGCACCJCI 

RKTH TCP PCP APEL LGG PSV 

2041 AGAAAGACCC ACACCTGCCC CCCTTGCCCT GCCCCTGAGC TGCTGGGCGG AGCCAGCGTG 

FLFP PKP KDT LM IS RTP E_VT 

2101 TTCCTGTTCC CCCCCAAGCC TAAGGACACC CTCATGATCA GCAGAACCCC- CGAGGTGACC 

C V V V D V S H E D P E V K F N W ; Y vV > D • 
2161 TGCGTGGTGG TGGACGTGAG CCACGAGGAT CCCGAGGTGA AGTTCAACTG GTACGTGGAC 

GVEV HNA KTK PREE QYN S T „ Y .• 
2221 GGCGTGGAGG TGCACAATGC CAAGACCAAG CCCAGAGAGG AGCAGTACAA CAGCACCTAC 

RVVS.VLT VLH QDWL NGK EYK 
2281 AGAGTGGTGA GCGTGCTCAC CGTGCTGCAC CAGGATTGGC TGAACGGCAA GGAGTACAAG 

CKVS NKA LPA PIEK TIS K A K 
2341 TGCAAGGTGA GCAACAAGGC CCTGCCTGCC CCCATCGAGA AAACCATCAG CAAGGCCAAG 

GQPR EPQ VYT LPPS RDE L.TK 
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2401 GGCCAGCCCA GAGAGCCCCA GGTGTACACC CTGCCCCCTA GCAGAGATGA GTTGACCAAG 

NQVS LTC LVK GFYP SDI AVE 
2461 AACCAGGTGA GCCTCACATG CCTGGTGAAG GGCTTCTACC CCAGCGACAT CGCCGTGGAG 

WESN GQP ENN YKTT PPV LDS 
2521 TGGGAGAGCA ACGGCCAGCC CGAGAACAAC TACAAGACCA CCCCCCCTGT GCTGGACAGC 

DGSF FLY SKL TVDK SRW QQG 
2581 GATGGCAGCT TCTTCCTGTA CAGCAAGCTC ACCGTGGACA AGAGCAGATG GCAGCAGGGC 

NVFS CSV MHE ALHN HYT QKS 
2641 AACGTGTTCA GCTGCAGCGT GATGCACGAG GCCCTGCACA ATCACTACAC CCAGAAGAGC 

LSLS PGK * (SEQ ID N0:46) 
2701 CTGAGCCTGA GCCCCGGCAA GTAA (SEQ ID NO: 25) 
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atggagtctc cctcggcccc tccccacaga tggtgcatcc cctggcagag gctcctgctc 60 
acagcctcac ttctaacctt ctggaacccg cccaccactg ccaagctcac tattgaatcc 120 
acgccgttca atgtcgcaga ggggaaggag gtgcttctac ttgtccacaa tctgccccag 180 
catctttttg gctacagctg gtacaaaggt gaaagagtgg atggcaaccg tcaaattata 240 
ggatatgtaa taggaactca acaagctacc ccagggcccg catacagtgg tcgagagata 300 
atatacccca atgcatccct gctgatccag aacatcatcc agaatgacac aggattctac 360 
accctacacg tcataaagtc agatcttgtg aatgaagaag caactggcca gttccgggta 420 
tacccggagc tgcccaagcc ctccatctcc agcaacaact ccaaacccgt ggaggacaag 480 
gatgctgtgg ccttcacctg tgaacctgag actcaggacg caacctacct gtggtgggta 540 
aacaatcaga gcctcccggt cagtcccagg ctgcagctgt ccaatggcaa caggaccctc 600 
actctattca atgtcacaag aaatgacaca gcaagctaca aatgtgaaac ccagaaccca 660 
gtgagtgcca ggcgcagtga ttcagtcatc ctgaatgtcc tctatggccc ggatgccccc 720 
accatttccc ctctaaacac atcttacaga tcaggggaaa atctgaacct ctcctgccac 780 
gcagcctcta acccacctgc acagtactct tggtttgtca atgggacttt ccagcaatcc 840 
acccaagagc tctttatccc caacatcact gtgaataata gtggatccta tacgtgccaa 900 
gcccataact cagacactgg cctcaatagg accacagtca cgacgatcac agtctatgca 960 
gagccaccca aacccttcat caccagcaac aactccaacc ccgtggagga tgaggatgct 1020 
gtagccttaa cctgtgaacc tgagattcag aacacaacct acctgtggtg ggtaaataat 1080 
cagagcctcc cggtcagtcc caggctgcag ctgtccaatg acaacaggac cctcactcta 1140 
ctcagtgtca caaggaatga tgtaggaccc tatgagtgtg gaatccagaa cgaattaagt 1200 
gttgaccaca gcgacccagt catcctgaat gtcctctatg gcccagacga ccccaccatt 1260 
tccccctcat acacctatta ccgtccaggg gtgaacctca gcctctcctg ccatgcagcc 1320 
tctaacccac ctgcacagta ttcttggctg attgatggga acatccagca acacacacaa 1380 
gagctcttta tctccaacat cactgagaag aacagcggac tctatacctg ccaggccaat 1440 
aactcagcca gtggccacag caggactaca gtcaagacaa tcacagtctc tgcggagctg 1500 
cccaagccct ccatctccag caacaactcc aaacccgtgg aggacaagga tgctgtggcc 1560 
ttcacctgtg aacctgaggc tcagaacaca acctacctgt ggtgggtaaa tggtcagagc 1620 . 
ctcccagtca gtcccaggct gcagctgtcc aatggcaaca ggaccctcac tctattcaat 1680 
gtcacaagaa atgacgcaag agcctatgta tgtggaatcc agaactcagt gagtgcaaac 1740 
cgcagtgacc cagtcaccct ggatgtcctc tatgggccgg acacccccat catttccc.cc .1800 , r _ _ . 
ccagactcgt cttacctttc gggagcgaac ctcaacctct cctgccactc ggcctctaac 1860 
ccatccccgc agtattcttg gcgtatcaat gggataccgc agcaacacac acaa.gttctc 1920 
tttatcgcca aaatcacgcc aaataataac gggacctatg cctgttttgt ctctaacttg _i;98G .-. .. 
gctactggcc gcaataattc catagtcaag agcatcacag tctctgcatc tggaact(SE.Q',ID.N0,:22) 
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1 MESPSAPPHR WCIPWQRLLL TASLLTFWNP PTTAKLTIES TPFNVAEGKE 

51 VLLLVHNLPQ HLFGYSWYKG ERVDGNRQII GYVIGTQQAT PGPAYSGREI 

101 IYPNASLLIQ NIIQNDTGFY TLHVIKSDLV NEEATGQFRV YPELPKPSIS 

151 SNNSKPVEDK DAVAFTCEPE TQDATYLWWV NNQSLPVSPR LQLSNGNRTL 

201 TLFNVTRNDT ASYKCETQNP VSARRSDSVI LNVLYGPDAP TISPLNTSYR 

251 SGENLNLSCH AASNPPAQYS WFVNGTFQQS TQELFIPNIT VNNSGSYTCQ 

301 AHNSDTGLNR TTVTTITVYA EPPKPFITSN NSNPVEDEDA VALTCEPEIQ 

351 NTTYLWWVNN QSLPVSPRLQ LSNDNRTLTL LSVTRNDVGP YECGIQNELS 

401 VDHSDPVILN VLYGPDDPTI SPSYTYYRPG VNLSLSCHAA SNPPAQYSWL 

451 IDGNIQQHTQ ELFISNITEK NSGLYTCQAN NSASGHSRTT VKTITVSAEL 

501 PKPSISSNNS KPVEDKDAVA FTCEPEAQNT TYLWWVNGQS LPVSPRLQLS 

551 NGNRTLTLFN VTRNDARAYV CGIQNSVSAN RSDPVTLDVL YGPDTPIISP 

601 PDSSYLSGAN LNLSCHSASN PSPQYSWRIN GIPQQHTQVL FIAKITPNNN 

651 GTYACFVSNL ATGRNNSIVK SITVSASGT (SEQ ID NO: 23) 
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DSTP IPF SYS KNLD C W V DNE 
1 GATTCAACAC CAATTCCATT TTCTTATTCT AAAAATCTGG ATTGTTGGGT TGATAATGAA 

EDID VIL K K S TILN L D I NND 
61 GAAGATATAG ATGTTATATT AAAAAAGAGT ACAATTTTAA ATTTAGATAT TAATAATGAT 

IISD ISG FNS SVIT YPD AQL 
121 ATTATATCAG ATATATCTGG GTTTAATTCA TCTGTAATAA CATATCCAGA TGCTCAATTG 

VPGI NGK A I H LVNN ESS EVI 
181 GTGCCCGGAA TAAATGGCAA AGCAATACAT TTAGTAAACA ATGAATCTTC TGAAGTTATA 

VHKA M D I EYN DMFiM NFT VSF 
241 GTGCATAAAG CTATGGATAT TGAATATAAT GATATGTTTA ATAATTTTAC CGTTAGCTTT 

WLRV PKV SAS HLEQ YGT NEY 
301 TGGTTGAGGG TTCCTAAAGT ATCTGCTAGT CATTTAGAAC AATATGGCAC AAATGAGTAT 

S I I S SMK K H S LSIG SGW SVS 
361 TCAATAATTA GCTCTATGAA AAAACATAGT CTATCAATAG GATCTGGTTG GAGTGTATCA 

LKGN NLI WTL KDSA GEV RQI 
421 CTTAAAGGTA ATAACTTAAT ATGGACTTTA AAAGATTCCG CGGGAGAAGT TAGACAAATA 

TFRD LPD K F N AYLA NKW VFI 
481 ACTTTTAGGG ATTTACCTGA TAAATTTAAT GCTTATTTAG CAAATAAATG GGTTTTTATA 
TITN DRL SSA N L Y I NGV LMG 
541 ACTATTACTA ATGATAGATT ATCTTCTGCT AATTTGTATA TAAATGGAGT ACTTATGGGA 

S A E I TGL G A I REDN NIT LKL 
601 AGTGCAGAAA TTACTGGTTT AGGAGCTATT AGAGAGGATA ATAATATAAC ATTAAAACTA 

DRCN NNN QYV SIDK FRI F C K 
661 GATAGATGTA ATAATAATAA TCAATACGTT TCTATTGATA AATTTAGGAT ATTTTGCAAA 

AL NP KEI E K L YTSY LSI TFL 
721 GCATTAAATC CAAAAGAGAT TGAAAAATTA TACACAAGTT ATTTATCTAT AACCTTTTTA 

RDFW GNP LRY DTDR *(SEQ ID N0:48)- 
781 AGAGACTTCT GGGGAAACCC TTTACGATAT GATACAGATA . GGTAG CSEQ LD_NQ:4~7-)- 
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1 ATGGAGTCTC CCTCGGCCCC TCCCCACAGA TGGTGCATCC CCTGGCAGAG GCTCCTGCTC 

61 ACAGCCTCAC TTCTAACCTT CTGGAACCCG CCCACCACTG CCAAGCTCAC TATTGAATCC 

121 ACGCCGTTCA ATGTCGCAGA GGGGAAGGAG GTGCTTCTAC TTGTCCACAA TCTGCCCCAG 

181 CATCTTTTTG GCTACAGCTG GTACAAAGGT GAAAGAGTGG ATGGCAACCG TCAAATTATA 

241 GGATATGTAA TAGGAACTCA ACAAGCTACC CCAGGGCCCG CATACAGTGG TCGAGAGATA 

301 ATATACCCCA ATGCATCCCT GCTGATCCAG AACATCATCC AGAATGACAC AGGATTCTAC 

361 ACCCTACACG TCATAAAGTC AGATCTTGTG AATGAAGAAG CAACTGGCCA GTTCCGGGTA 

421 TACCCGGAGC TGCCCAAGCC CTCCATCTCC AGCAACAACT CCAAACCCGT GGAGGACAAG 

481 GATGCTGTGG CCTTCACCTG TGAACCTGAG ACTCAGGACG CAACCTACCT GTGGTGGGTA 

541 AACAATCAGA GCCTCCCGGT CAGTCCCAGG CTGCAGCTGT CCAATGGCAA CAGGACCCTC 

601 ACTCTATTCA ATGTCACAAG AAATGACACA GCAAGCTACA AATGTGAAAC CCAGAACCCA 

661 GTGAGTGCCA GGCGCAGTGA TTCAGTCATC CTGAATGTCC TCTATGGCCC GGATGCCCCC 

721 ACCATTTCCC CTCTAAACAC ATCTTACAGA TCAGGGGAAA ATCTGAACCT CTCCTGCCAC 

781 GCAGCCTCTA ACCCACCTGC ACAGTACTCT TGGTTTGTCA ATGGGACTTT CCAGCAATCC 

841 ACCCAAGAGC TCTTTATCCC CAACATCACT GTGAATAATA GTGGATCCTA TACGTGCCAA 

901 GCCCATAACT CAGACACTGG CCTCAATAGG ACCACAGTCA CGACGATCAC AGTCTATGCA 

961 GAGCCACCCA AACCCTTCAT CACCAGCAAC AACTCCAACC CCGTGGAGGA TGAGGATGCT 

1021 GTAGCCTTAA CCTGTGAACC TGAGATTCAG AACACAACCT ACCTGTGGTG GGTAAATAAT 

1081 CAGAGCCTCC CGGTCAGTCC CAGGCTGCAG CTGTCCAATG ACAACAGGAC CCTCACTCTA 

1141 CTCAGTGTCA CAAGGAATGA TGTAGGACCC TATGAGTGTG GAATCCAGAA CGAATTAAGT 

1201 GTTGACCACA GCGACCCAGT CATCCTGAAT GTCCTCTATG GCCCAGACGA CCCCACCATT 

1261 TCCCCCTCAT ACACCTATTA CCGTCCAGGG GTGAACCTCA GCCTCTCCTG CCATGCAGCC 

1321 TCTAACCCAC CTGCACAGTA TTCTTGGCTG ATTGATGGGA ACATCCAGCA ACACACACAA 

1381 GAGCTCTTTA TCTCCAACAT CACTGAGAAG AACAGCGGAC TCTATACCTG CCAGGCCAAT 

1441 AACTCAGCCA GTGGCCACAG CAGGACTACA GTCAAGACAA TCACAGTCTC TGCGGAGCTG 

1501 CCCAAGCCCT CCATCTCCAG CAACAACTCC AAACCCGTGG AGGACAAGGA TGCTGTGGCC 

1561 TTCACCTGTG AACCTGAGGC TCAGAACACA ACCTACCTGT GGTGGGTAAA TGGTCAGAGC . 

1621 CTCCCAGTCA GTCCCAGGCT GCAGCTGTCC AATGGCAACA GGACCCTCAC TCTATTCAAT 

1681 GTCACAAGAA ATGACGCAAG AGCCTATGTA TGTGGAATCC AGAACTCAGT GAGTGCAAAC 

1741 CGCAGTGACC CAGTCACCCT GGATGTCCTC TATGGGCCGG ACACCCCCAT CATTTCCCCC 

1801 CCAGACTCGT CTTACCTTTC GGGAGCGAAC CTCAACCTCT CCTGCCACTC GGCCTCTAAC 

1861 CCATCCCCGC AGTATTGJTG GCGTATCAAT GGGATACCGC AGCAACACAC ACAAGTTCTC 

1921 TTTATCGCCA AAATCACGCC AAATAATAAC GGGACCTATG CCTGTTTTGT CTCTAACTTG 

1981 GCTACTGGCC GCAATAATTC CATAGTCAAG AGCATCACAG TCTCTGCATC TGGAACTCTA 

2041 GATTCAACAC CAATTCCATT TTCTTATTCT AAAAATCTGG ATTGTTGGGT TGATAATGAA 

2101 GAAGATATAG ATGTTATATT AAAAAAGAGT ACAATTTTAA ATTTAGATAT TAATAATGAT 

2161 ATTATATCAG ATATATCTGG GTTTAATTCA TCTGTAATAA CATATCCAGA TGCTCAATTG 

2221 GTGCCCGGAA TAAATGGCAA AGCAATACAT TTAGTAAACA ATGAATCTTC TGAAGTTATA 

2281 GTGCATAAAG CTATGGATAT TGAATATAAT GATATGTTTA ATAATTTTAC CGTTAGCTTT 

2341 TGGTTGAGGG TTCCTAAAGT ATCTGCTAGT CATTTAGAAC AATATGGCAC AAATGAGTAT 

2401 TCAATAATTA GCTCTATGAA AAAACATAGT CTATCAATAG GATCTGGTTG GAGTGTATCA 
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2461 CTTAAAGGTA ATAACTTAAT ATGGACTTTA AAAGATTCCG CGGGAGAAGT TAGACAAATA 

2521 ACTTTTAGGG ATTTACCTGA TAAATTTAAT GCTTATTTAG CAAATAAATG GGTTTTTATA 

2581 ACTATTACTA ATGATAGATT ATCTTCTGCT MTTTGTATA TAAATGGAGT ACTTATGGGA 

2641 AGTGCAGAAA TTACTGGTTT AGGAGCTATT AGAGAGGATA ATAATATAAC ATTAAAACTA 

2701 GATAGATGTA ATAATAATAA TCAATACGTT TCTATTGATA AATTTAGGAT ATTTTGCAAA 

2761 GCATTAAATC CAAAAGAGAT TGAAAAATTA TACACAAGTT ATTTATCTAT AACCTTTTTA 

2821 AGAGACTTCT GGGGAAACCC TTTACGATAT GATATAG (SEQ ID NO: 49) 
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1 ATGGGCAGCC CCAGCGCCCC 

CCTGCTGCTG ACCGCCAGCC 
101 CCCAGCTGAC CATCGAGAGC 

GTGCTGCTGC TGGCCCACAA 
201 GTACAAGGGC GAGCGCGTGG 

TCCGCACCCA GCAGATCACC 
301 ATCGACTTCA ACGCCAGCCT 

CGGCAGCTAC ACCATCCAGG 
401 CCACCGGCCA GTTCCGCGTG 

AGCAACAACA GCAACCCCGT 
501 CGAGCCCGAG ACCCAGGACA 

GCCTGCCCGT GAGCCCCCGC 
601 ACCGTGTTCA ACATCCCCCG 

CCAGAACCCC GTGAGCGTGC 
701 TGTACGGCCC CGACGCCCCC 

GCCGGCGAGA ACCTGAACCT 
801 CCAGTACTTC TGGTTCGTGA 

TGTTCATCCC CAACATCACC 
901 GCCCACAACA GCGCCACCGG 

CGTGTACGCC GAGCTGCCCA 
1001 CCATCGAGGA CAAGGACGCC 

GACACCACCT ACCTGTGGTG 
1101 CCGCCTGGAG CTGAGCAACG 

CCCGCAACGA CACCACCTTC 
1201 GTGCGCCGCA GCGACCCCGT 

CCCCACCATC AGCCCCCTGA 
1301 ACCTGAGCTG CCACGCCGCC 

GTGAACGGCA CCTTCCAGCA 
1401 CACCGTGAAC AACAGCGGCA 

CCGGCCTGAA CCGCACCACC 
1501 CCCAAGCCCT ACATCAGCAG 

CGCCGTGACC CTGACCTGCG 
1601 GGTGGGTGAA CAACCAGAGC 

AACGGCAACC GCATCCTGAC 
1701 CCCCTACGAG TGCGGCATCC 

CCGTGACCCT GAACGTGACC 
1801 CCCGACCTGA GCTACCGCAG 

CGACAGCAAC CCCAGCCCCC 
1901 GCCAGCACAC CCAGGTGCTG 

GGCGCCTACG CCTGCTTCGT 
2001 CATCGTGAAG AACATCAGCG 

CCATCCCATT CAGCTACAGC 
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CCTGCACCGC TGGTGCATCC CCTGGCAGAC 
TGCTGACCTT CTGGAACCCC CCCACCACCG 
CGCCCCTTCA ACGTGGCCGA GGGCAAGGAG 
CGTGAGCCAG AACCTGTTCG GCTACATCTG 
ACGCCAGCCG CCGCATCGGC AGCTGCGTGA 
CCCGGCCCCG CCCACAGCGG CCGCGAGACC 
GCTGATCCAC AACGTGACCC AGAGCGACAC 
TGATCAAGGA GGACCTGGTG AACGAGGAGG 
TACCCCGAGC TGCCCAAGCC CTACATCAGC 
GGAGGACAAG GACGCCGTGG CCCTGACCTG 
CCACCTACCT GTGGTGGGTG AACAACCAGA 
CTGGAGCTGA GCAGCGACAA CCGCACCCTG 
CAACGACACC ACCAGCTACA AGTGCGAGAC 
GCCGCAGCGA CCCCGTGACC CTGAACGTGC 
ACCATCAGCC CCCTGAACAC CCCCTACCGC 
GACCTGCCAC GCCGCCAGCA ACCCCACCGC 
ACGGCACCTT CCAGCAGAGC ACCCAGGAGC 
GTGAACAACA GCGGCAGCTA CATGTGCCAG 
CCTGAACCGC ACCACCGTGA CCGCCATCAC 
AGCCCTACAT CACCAGCAAC AACAGCAACC 
GTGACCCTGA CCTGCGAGCC CGAGACCCAG 
GGTGAACAAC CAGAGCCTGA GCGTGAGCAG 
ACAACCGCAC CCTGACCGTG TTCAACATCC 
TACGAGTGCG AGACCCAGAA CCCCGTGAGC 
GACCCTGAAC GTGCTGTACG GCCCCGACGC 
ACACCCCCTA CCGCGCCGGC GAGAACCTGA 
AGCAACCCCG CCGCCCAGTA CAGCTGGTTC 
GAGCACCCAG GAGCTGTTCA TCCCCAACAT 
GCTACATGTG CCAGGCCCAC AACAGCGCCA 
GTGACCGCCA TCACCGTGTA CGTGGAGCTG 
CAACAACAGC AACCCCATCG AGGACAAGGA 
AGCCCGTGGC CGAGAACACC ACCTACCTGT 
CTGAGCGTGA GCCCCCGCCT GCAGCTGAGC 
CCTGCTGAGC GTGACCCGCA ACGACACCGG 
AGAACAGCGA GAGCGCCAAG CGCAGCGACC 
TACGGCCCCG ACACCCCCAT CATCAGCCCC 
CGGCGCCAAC CTGAACCTGA GCTGCCACAG 
AGTACAGCTG GCTGATCAAC GGCACCCTGC 
TTCATCAGCA AGATCACCAG CAACAACAGC 
GAGCAACCTG GCCACCGGCC GCAACAACAG 
TGAGCAGCGG CGACAGCTCT AGAAGCACCC 
AAGAACCTGG ACTGCTGGGT GGACAACGAG 
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2101 GAGGACATCG ACGTGATCCT 

CAACAACGAC ATCATCAGCG 
2201 CCTACCCCGA CGCCCAGCTG 

CTGGTGAACA ACGAGAGCAG 
2301 CGAGTACAAC GACATGTTCA 

TGCCTAAGGT GAGCGCCAGC 
2401 AGCATCATCA GCAGCATGAA 

GAGCGTGAGC CTGAAGGGCA 
2501 CCGGAGAGGT GAGACAGATC 

GCCTACCTGG CCAACAAGTG 
2601 GAGCAGCGCC AACCTGTACA 

TCACCGGCCT GGGCGCCATC 
2701 GACAGATGCA ACAACAACAA 

CTTCTGCAAG GCCCTGAACC 
2801 ACCTGAGCAT CACCTTCCTG 

GACACCTAG (SEQ ID NO:! 
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GAAGAAGAGC ACCATCCTGA ACCTGGACAT 
ACATCAGCGG CTTCAACAGC AGCGTGATCA 
GTGCCCGGCA TCAACGGCAA GGCCATCCAC 
CGAGGTGATC GTGCACAAGG CCATGGACAT 
ACAACTTCAC CGTGAGCTTC TGGCTGAGAG 
CACCTGGAGC AGTACGGCAC CAACGAGTAC 
GAAGCACAGC CTGAGCATCG GCAGCGGCTG 
ACAACCTCAT CTGGACCCTG AAGGATAGCG 
ACCTTCAGAG ACCTGCCCGA CAAGTTCAAT 
GGTGTTCATC ACCATCACCA ACGACAGACT 
TCAACGGCGT GCTCATGGGC AGCGCCGAGA 
AGAGAGGACA ACAACATCAC CCTGAAGCTG 
CCAGTACGTG AGCATCGACA AGTTCCGGAT 
CCAAGGAGAT CGAGAAGCTG TACACCAGCT 
AGAGACTTCT GGGGCAACCC CCTGAGATAC 
.0) 
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1 MGSPSAPLHR WCIPWQTLLL TASLLTFWNP PTTAQLTIES RPFNVAEGKE 

51 VLLLAHNVSQ NLFGYIWYKG ERVDASRRIG SCVIRTQQIT PGPAHSGRET 

101 IDFNASLLIH NVTQSDTGSY TIQVIKEDLV NEEATGQFRV YPELPKPYIS 

151 SNNSNPVEDK DAVALTCEPE TQDTTYLWWV NNQSLPVSPR LELSSDNRTL 

201 TVFNIPRNDT TSYKCETQNP VSVRRSDPVT LNVLYGPDAP TISPLNTPYR 

251 AGENLNLTCH AASNPTAQYF WFVNGTFQQS TQELFIPNIT VNNSGSYMCQ 

301 AHNSATGLNR TTVTAITVYA ELPKPYITSN NSNPIEDKDA VTLTCEPETQ 

351 DTTYLWWVNN QSLSVSSRLE LSNDNRTLTV FNIPRNDTTF YECETQNPVS 

401 VRRSDPVTLN VLYGPDAPTI SPLNTPYRAG ENLNLSCHAA SNPAAQYSWF 

451 VNGTFQQSTQ ELFIPNITVN NSGSYMCQAH NSATGLNRTT VTAITVYVEL 

501 PKPYISSNNS NPIEDKDAVT LTCEPVAENT TYLWWVNNQS LSVSPRLQLS 

551 NGNRILTLLS VTRNDTGPYE CGIQNSESAK RSDPVTLNVT YGPDTPIISP 

601 PDLSYRSGAN LNLSCHSDSN PSPQYSWLIN GTLRQHTQVL FISKITSNNS 

651 GAYACFVSNL ATGRNNSIVK NISVSSGDSS RSTPIPFSYS KNLDCWVDNE 

701 EDIDVILKKS TILNLDINND IISDISGFNS SVITYPDAQL VPGINGKAIH 

751 LVNNESSEVI VHKAMDIEYN DMFNNFTVSF WLRVPKVSAS HLEQYGTNEY 

801 SIISSMKKHS LSIGSGWSVS LKGNNLIWTL KDSAGEVRQI TFRDLPDKFN 

851 AYLANKWVFI TITNDRLSSA NLYINGVLMG SAEITGLGAI REDNNITLKL 

901 DRCNNNNQYV SIDKFRIFCK ALNPKEIEKL YTSYLSITFL RDFWGNPLRY 

951 DT* (SEQ ID N0:51) 
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1 ATGGGCAGCC CCAGCGCCCC 
CCTGCTGCTG ACCGCCAGCC 
101 CCCAGCTGAC CATCGAGAGC 
GTGCTGCTGC TGGCCCACAA 
201 GTACAAGGGC GAGCGCGTGG 
TCCGCACCCA GCAGATCACC 
301 ATCGACTTCA ACGCCAGCCT 
CGGCAGCTAC ACCATCCAGG 
401 CCACCGGCCA GTTCCGCGTG 
AGCAACAACA GCAACCCCGT 
501 CGAGCCCGAG ACCCAGGACA 
GCCTGCCCGT GAGCCCCCGC 
601 ACCGTGTTCA ACATCCCCCG 
CCAGAACCCC GTGAGCGTGC 
701 TGTACGGCCC CGACGCCCCC 
GCCGGCGAGA ACCTGAACCT 
801 CCAGTACTTC TGGTTCGTGA 
TGTTCATCCC CAACATCACC 
901 GCCCACAACA GCGCCACCGG 
CGTGTACGCC GAGCTGCCCA 
1001 CCATCGAGGA CAAGGACGCC 
GACACCACCT ACCTGTGGTG 
1101 CCGCCTGGAG CTGAGCAACG 
CCCGCAACGA CACCACCTTC 
1201 GTGCGCCGCA GCGACCCCGT 
CCCCACCATC AGCCCCCTGA 
1301 ACCTGAGCTG CCACGCCGCC 
GTGAACGGCA CCTTCCAGCA 
1401 CACCGTGAAC AACAGCGGCA 
CCGGCCTGAA CCGCACCACC 
1501 CCCAAGCCCT ACATCAGCAG 
CGCCGTGACC CTGACCTGCG 
1601 GGTGGGTGAA CAACCAGAGC 
AACGGCAACC GCATCCTGAC 
1701 CCCCTACGAG TGCGGCATCC 
CCGTGACCCT GAACGTGACC 
1801 CCCGACCTGA GCTACCGCAG 
CGACAGCAAC CCCAGCCCCC 
1901 GCCAGCACAC CCAGGTGCTG 
GGCGCCTACG CCTGCTTCGT 
2001 CATCGTGAAG AACATCAGCG 
AGAACATCAC CGATCTGTGC 
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CCTGCACCGC TGGTGCATCC CCTGGCAGAC 
TGCTGACCTT CTGGAACCCC CCCACCACCG 
CGCCCCTTCA ACGTGGCCGA GGGCAAGGAG 
CGTGAGCCAG AACCTGTTCG GCTACATCTG 
ACGCCAGCCG CCGCATCGGC AGCTGCGTGA 
CCCGGCCCCG CCCACAGCGG CCGCGAGACC 
GCTGATCCAC AACGTGACCC AGAGCGACAC 
TGATCAAGGA GGACCTGGTG AACGAGGAGG 
TACCCCGAGC TGCCCAAGCC CTACATCAGC 
GGAGGACAAG GACGCCGTGG CCCTGACCTG 
CCACCTACCT GTGGTGGGTG AACAACCAGA 
CTGGAGCTGA GCAGCGACAA CCGCACCCTG 
CAACGACACC ACCAGCTACA AGTGCGAGAC 
GCCGCAGCGA CCCCGTGACC CTGAACGTGC 
ACCATCAGCC CCCTGAACAC CCCCTACCGC 
GACCTGCCAC GCCGCCAGCA ACCCCACCGC 
ACGGCACCTT CCAGCAGAGC ACCCAGGAGC 
GTGAACAACA GCGGCAGCTA CATGTGCCAG 
CCTGAACCGC ACCACCGTGA CCGCCATCAC 
AGCCCTACAT CACCAGCAAC AACAGCAACC 
GTGACCCTGA CCTGCGAGCC CGAGACCCAG 
GGTGAACAAC CAGAGCCTGA GCGTGAGCAG 
ACAACCGCAC CCTGACCGTG TTCAACATCC 
TACGAGTGCG AGACCCAGAA CCCCGTGAGC 
GACCCTGAAC GTGCTGTACG GCCCCGACGC 
ACACCCCCTA CCGCGCCGGC GAGAACCTGA 
AGCAACCCCG CCGCCCAGTA CAGCTGGTTC 
GAGCACCCAG GAGCTGTTCA TCCCCAACAT 
GCTACATGTG CCAGGCCCAC AACAGCGCCA 
GTGACCGCCA TCACCGTGTA CGTGGAGCTG^ 
CAACAACAGC AACCCCATCG AGGACAAGGA 
AGCCCGTGGC CGAGAACACC ACCTACCTGT 
CTGAGCGTGA GCCCGCGCCT GCAGCTGAGC 
CCTGCTGAGC GTGACCCGCA ACGACACCGG 
AGAACAGCGA GAGCGCCAAG CGCAGCGACC 
TACGGCCCCG ACACCCCCAT CATCAGCCCC 
CGGCGCCAAC CTGAACCTGA GCTGGCACAG 
AGTACAGCTG GCTGATCAAC GGCACCCTGC 
TTCATCAGCA AGATCACCAG CAACAACAGC 
GAGCAACCTG GCCACCGGCC GCAACAACAG 
TGAGCAGCGG CGACAGCTCT AGAACCCCTC 
GCCGAGTACC ACAACACCCA GATCTACACC 
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2101 CTGAACGACA AGATCTTCAG CTACACCGAG AGCCTGGCCG GCAAGAGAGA 
GATGGCCATC ATCACCTTCA AGAACGGCGC CATCTTCCAG GTGGAGGTGC 

2201 CCGGCAGCCA GCACATCGAC AGCCAGAAGA AGGCCATCGA GCGGATGAAG 
GACACCCTGC GGATCGCCTA CCTCACCGAG GCCAAGGTGG AGAAGCTGTG 

2301 CGTGTGGAAC AACAAGACCC CTCACGCCAT CGCCGCCATC AGCATGGCCA 
ATTGATAAG (SEQ ID NO: 52) 
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1 


MGSPSAPLHR 


WCIPWQTLLL 


51 


VLLLAHNVSQ 


NLFGYIWYKG 


101 


IDFNASLLIH 


NVTQSDTGSY 


151 


SNNSNPVEDK 


DAVALTCEPE 


201 


TVFNIPRNDT 


TSYKCETQNP 


251 


AGENLNLTCH 


AASNPTAQYF 


301 


AHNSATGLNR 


TTVTAITVYA 


351 


DTTYLWWVNN 


QSLSVSSRLE 


401 


VRRSDPVTLN 


VLYGPDAPTI 


451 


VNGTFQQSTQ 


ELFI PNITVN 


501 


PKPYISSNNS 


NPIEDKDAVT 


551 


NGNRILTLLS 


VTRNDTGPYE 


601 


PDLSYRSGAN 


LNLSCHSDSN 


651 


GAYACFVSNL 


ATGRNNSIVK 


701 


LNDKIFSYTE 


SLAGKREMAI 


751 


DTLRIAYLTE 


AKVEKLCVWN 



TASLLTFWNP PTTAQLTIES RPFNVAEGKE 
ERVDASRRIG SCVIRTQQIT PGPAHSGRET 
TIQVIKEDLV NEEATGQFRV YPELPKPYIS 
TQDTTYLWWV NNQSLPVSPR LELSSDNRTL 
VSVRRSDPVT LNVLYGPDAP TISPLNTPYR 
WFVNGTFQQS TQELFIPNIT VNNSGSYMCO 
ELPKPYITSN NSNPIEDKDA VTLTCEPETO 
LSNDNRTLTV FNiPRKDTTF YECETQNPVS 
SPLNTPYRAG ENLNLSCHAA SNPAAQYSWF 
NSGSYMCQAH NSATGLNRTT VTAITVYVEL 
LTCEPVAENT TYLWWVNNQS LSVSPRLQLS 
CGIQNSESAK RSDPVTLNVT YGPDTPIISP 
PSPQYSWLIN GTLRQHTQVL FISKITSNNS 
NISVSSGDSS RTPQNITDLC AEYHNTQIYT 
ITFKNGAIFQ VEVPGSQHID SQKKAIERMK 
NKTPHAIAAI SMAN** (SEQ ID NO: 53) 
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